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Abstract 

This is an expository account of the edge eigenvalue distributions in 
random matrix theory and their application in multivariate statistics. 
The emphasis is on the Painleve representations of these distribution 
functions. 
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1 Multivariate Statistics 



1.1 Wishart distribution 



The basic problem in statistics is testing the agreement between actual obser- 
vations and an underlying probability model. Pearson in 1900 [2Z1 introduced 
the famous X 2 test where the sampling distribution approaches, as the sam- 
ple size increases, to the X 2 distribution. Recall that if Xj are independent 
and identically distributed standard normal random variables, N(0, 1), then 
the distribution of 



xi 



x 2 + 



+ x 2 



has density 



fn{x) 



i T 

2™/ 2 r(n/2) ^ 





^/2-l p -x/2 



for x > 0, 
for x < 0, 



;i-2) 



where T(x) is the gamma function. 

In classical multivariate statistics 1 it is commonly assumed that the un- 
derlying distribution is the multivariate normal distribution. If X is a p x 1- 
variate normal with E(X) = p and p x p covariance matrix X = cov(X) : = 
E ((X — p) ® (X — p)) 2 denoted N p (p, S), then if E > the density function 
of X is 



fxix) = (2vr)- p / 2 (detS)- i/z exp 



-1/2 



a; e 



where (■, ■) is the standard inner product on W. 

It is convenient to introduce a matrix notation: If X is a n x p matrix (the 
data matrix) whose rows Xj are independent N p (p, S) random variables, 



X 



/ 



V 



x x 
x 2 

x„ 



\ 



then we say X is N p (l <g> p,I n ® E) where 1 = (1, 1, . . . , 1) and is the n x n 
identity matrix. We now introduce the multivariate generalization of (jl.lj) . 

1 There are many excellent textbooks on multivariate statistics, we mention Anderson 
PQ, Muirhead [2E|j and for a shorter introduction, Bilodeau and Brenner 0]. 

2 If u and v are vectors we denote by u <g> v the matrix with (i, j) matrix element UiVj. 
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Definition 1.1. If A = X t X, where the n x p matrix X is N p (0,I n (g) S) ; 
then A is said to have Wishart distribution with n degrees of freedom and 
covariance matrix £ . We write A is W p (n, E) . 

To state the generalization of (|1.2|) we first introduce the multivariate 
Gamma function. If 5+ is the space of p x p positive definite, symmetric 
matrices, then 

r p (a) := / e" tr(A) (det A) a ~ (p+1)/2 dA 

where Re(a) > (m — l)/2 and dA is the product Lebesgue measure of the 
\p{p + 1) distinct elements of A. By introducing the matrix factorization 
A = T l T where T is upper-triangular with positive diagonal elements, one 
can evaluate this integral in terms of ordinary gamma functions, see, page 
62 in Note that V\{a) is the usual gamma function T(a). The basic fact 
about the Wishart distributions is 

Theorem 1.2 (Wishart |38j). If A is W p (n, £) withn > p, then the density 
function of A is 

e-5 ^ 1A ) (det A) {n ~ p - 1)/2 . (1.3) 

2P«/ 2 r p (n/2) (detS) n/2 

For p = 1 and E = 1 (jl.3j) reduces to (jl.2|) . The case p = 2 was obtained 
by Fisher in 1915 and for general p by Wishart in 1928 using geometrical 
arguments. Most modern proofs follow James j2H|- The importance of the 
Wishart distribution lies in the fact that the sample covariance matrix, S, is 
W p (n, ^E) where 

i N _ _ 

S := - ( x i ~ x ) ® ( x i ~ X) , N = n + 1, 

U 3=1 

and Xj, j = 1, . . . , N, are independent N p (/i, E) random vectors, and X = 

Principle component analysis, 3 a multivariate data reduction technique, 
requires the eigenvalues of the sample covariance matrix; in particular, the 

3 See, for example, Chap. 9 in [23, and [221 f° r a discussion of some current issues in 
principle component analysis. 



4 



largest eigenvalue (the largest principle component variance) is most impor- 
tant. The next major result gives the joint density for the eigenvalues of a 
Wishart matrix. 

Theorem 1.3 (James [21J). If A is W p (n, E) with n > p the joint density 
function of the eigenvalues £i, . . . , £ p of A is 



7Y 



p 2 /2r)-pn/2 sr\- n / 2 p 



3 

3=1 j<k 

■htTCZ- 1 HLH t ) 



T p (p/2)T p (n/2) 

■ f e-W-* ULM '> dH (1.4) 
Jo(p) 

where Q(p) is the orthogonal group ofpxp matrices, dH is normalized Haar 
measure and L is the diagonal matrix diag(£i, . . . ,£ p ). (We take t\ > £ 2 > 

■■■> £ P .) 

Remark 1.4. The difficult part of this density function is the integral over the 
orthogonal group 0(p). There is no known closed formula for this integral 
though James and Constantine (see Chap. 7 in j2H] for references) developed 
the theory of zonal polynomials which allow one to write infinite series ex- 
pansions for this integral. However, these expansions converge slowly; and 
zonal polynomials themselves, lack explicit formulas such as are available 
for Schur polynomials. For complex Wishart matrices, the group integral is 
over the unitary group U(p); and this integral can be evaluated using the 
Harish-Chandra-Itzykson-Zuber integral 

There is one important case where the integral can be (trivially) evalu- 
ated. 

Corollary 1.5. IfT, = I p , then the joint density \l-4\j simplifies to 

*£5Sr ffl '-(-to) n«-* m 



1.2 An example with S 7^ cl p 

This section uses the theory of zonal polynomials as can be found in Chap. 7 
of Muirhead [26] or Macdonald [23j . This section is not used in the remainder 
of the chapter. Let A = (Ai, . . . , A p ) be a partition into not more than p 
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parts. We let C\{Y) denote the zonal polynomial of Y corresponding to A. 
It is a symmetric, homogeneous polynomial of degree | A | in the eigenvalues 
. . . , y p of Y. The normalization we adopt is defined by 

(trY) k = (y 1 + ... + y p ) k = £ C X (Y). 

l(X)<p 

The fundamental integral formula for zonal polynomials is 4 
Theorem 1.6. Let X, Y e S+ , then 

C x (XHYH t )dH= Cxi ^f ) (1.6) 

where dH is normalized Haar measure. 

By expanding the exponential and using (jl.fij) it follows that 

/ exp (ztr(XHYH>)) iff = £ £ £ ^fff ■ (1.7) 



fc>0 ai-* 

£(A)<p 



We examine ()1.7|) for the special case (|p| < 1 
E = (l-p)/ p + pl® 1 



/ 1 p p ••• p \ 
p 1 p ••• p 



\P P P ■■■ 1 

We have 

5T 1 = (1 - py 1 L - w 9 , — 1(8)1 

1 P) P (l-p)(l + (p-l)p) 

and 

detE = (l-p) p - 1 (l + (p-l)p). 

For this choice of E, let Y — al ® 1 where a = p/((2(l — p)(l + (p — l)p)), 
then 

oo) Vo(p) 

A? ^(7 (fc) (al®l)(7( fe) (A) 



g 2(l-p) £j A J 



^ee, for example, Theorem 7.2.5 in |2fij . 
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where we used the fact that the only partition A h k for which C\(Y) is 
nonzero is A = (k). And for this partition, C^){Y) = a h p k . Define the 
symmetric functions g n 5 by 

H(l-x 1 y)- 1 / 2 = J29n(x)y n , 

j>X n>0 

then it is known that [2*3] 

Using the known value of C^){I p ) we find 

where (a)k = a(a + 1) • • • (a + k — 1) is the Pochammer symbol. 



2 Edge Distribution Functions 

2.1 Summary of Fredholm determinant representations 

In this section we define three Fredholm determinants from which the edge 
eigenvalue distributions, for the three symmetry classes orthogonal, unitary 
and symplectic, will ensue. This section follows [!JTJ [33J [3?""| ; see also, "To"[ IT"""] . 

In the unitary case (/3 = 2), define the trace class operator K 2 on L 2 (s, 00) 
with Airy kernel 

T ^ , . Ai(ar) Ai'(y) - Ai'(x) A%) f°° ... .... SJ , niN 

K M (x,y):=—^- — — — = / Ai(x + z) Ai(y + z) dz (2.1) 

x ~~ y Jo 

and associated Fredholm determinant, < A < 1, 

D 2 (s,X) = det(I-XK 2 ). (2.2) 
Then we introduce the distribution functions 

^2 00 = F 2 (s, 1) = D 2 (s, 1), (2.3) 

°In the theory of zonal polynomials, the g n are the analogue of the complete symmetric 
functions h n . 
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and for m > 2, the distribution functions F 2 (s,m) are defined recursively 
below by lEESI) . 

In the symplectic case (j3 = 4), we define the trace class operator K4 on 
L 2 (s, 00) © £ 2 (s, 00) with matrix kernel 



(2.4) 



where 



_ 1 / 5 4 (x,y) SD 4 (x,y) 

1 Z" 00 
5 4 (x,y) = K A i(z, - - Ai(x, y) J Ai(z)dz, 

SD 4 (x,y) = -d y K M (x,y) - ~ Ai(ar) Ai(y), 

POO /»oo /* oc 

IS 4 (x,y) = - K A i(z,y)dz+- / Ai(z) dz ■ / Ai(z), 

1/ x ^ J x J y 

and the associated Fredholm determinant, < A < 1, 

-D 4 (s, A) = det(J - XK 4 ). (2.5) 

Then we introduce the distribution functions (note the square root) 

F 4 (s) = F 4 (s, 1) = 1) , (2.6) 

and for m > 2, the distribution functions F 4 (s,m) are defined recursively 
below by (jSHJ). 

In the orthogonal case ((3 = 1), we introduce the matrix kernel 



*!(*,»):=( rc/ 1 ^, ^ ) (2-7) 

1 ISi{x,y)-e(x,y) Si(y,x) 1 



where 
Si(x,y) 



K M (x, y)~\ Ai(x) ^1 - y M(z) dz^j 



SDi(x,y) = -d y K A i{x,y) - - A\(x) Ai[y), 



poo / px poo roc \ 

IS t (x,y) = -J K Ai (z,y)dz + - Ai(z) dz + J Ai(z) dz ■ J Ai(z) dzj 



e{x-y) = -sgn(x-y). 



The operator K\ on L 2 (s, oo) © L 2 (s, oo) with this matrix kernel is not trace 
class due to the presence of e. As discussed in one must use the weighted 
space £ 2 (p) ©L 2 (p -1 ), G L 1 . Now the determinant is the 2-determinant, 

-Di(s, A) = det 2 (J - A KiXj) (2.8) 

where Xj is the characteristic function of the interval (s, oo). We introduce 
the distribution functions (again note the square root) 

F 1 (s)=F 1 (s,l) = y/D 1 (s, 1), (2.9) 

and for m > 2, the distribution functions Fi(s,m) are defined recursively 
below by (|3.11j) . This is the first indication that the determinant Di(s,X) 
might be more subtle than either D2(s, A) or D^s, A). 



2.2 Universality theorems 

Suppose A is W p (n,I p ) with eigenvalues l\ > ■ ■ ■ > i p . We define scaling 
constants 



/^ = (vV^I +v /p) 2 , a np = (v/^I + v /p) ^_L= + .J_j 



1/3 



The following theorem establishes, under the null hypothesis £ = I p , that 
the largest principal component variance, £\, converges in law to F\. 

Theorem 2.1 (Johnstone, [22j ) . Ifn,p oo such thatn/p — * 7, < 7 < 
00, £/ien 

^1 - ^ ® 



np 



Fi(s, 1). 



Johnstone's theorem generalizes to the m largest eigenvalue. 

Theorem 2.2 (Soshnikov, [29j ) . Ifn,p — > 00 such thatn/p — > 7,0 < 7 < 
00 , i/ien 



a 

np 



Fi(s, m), m — 1, 2, 



Soshnikov proved his result under the additional assumption p = O^p 1 ^ 3 
We remark that a straightforward generalization of Johnstone's proof 
together with results of Dieng ^U] show this restriction can be removed. 
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Subsequently, El Karoui [13] extended Theorem 12.21 to < 7 < oo. The 
extension to 7 = 00 is important for modern statistics where p ^> n arises in 
applications. 

Going further, Soshnikov lifted the Gaussian assumption, again establish- 
ing a F\ universality theorem. In order to state the generalization precisely, 
let us redefine the n x p matrices X = {xij} such that A = X l X to satisfy 



1. E(2Cy) = 0, E(x 



2. The random variables have symmetric laws of distribution. 

3. All even moments of Xij are finite, and they decay at least as fast as a 
Gaussian at infinity: E(x^ m ) < (const m) m . 

4. n-p= 0(p 1/3 ). 
With these assumptions, 
Theorem 2.3 (Soshnikov, [25]). 

— > rAs, m), m = 1, 2, 

a 

np 

It is an important open problem to remove the restriction n — p = Ofj? 1 / 3 ). 

For real symmetric matrices, Deift and Gioev [Hj, building on the work 
of Widom jH7j, proved F\ universality when the Gaussian weight function 
exp(— x 2 ) is replaced by exp(— V(x)) where V is an even degree polynomial 
with positive leading coefficient. 

Table 01 in Section El displays a comparison of the percentiles of the F\ 
distribution with percentiles of empirical Wishart distributions. Here tj de- 
notes the j th largest eigenvalue in the Wishart Ensemble. The percentiles 
in the £j columns were obtained by finding the ordinates corresponding to 
the .Fi-percentiles listed in the first column, and computing the proportion 
of eigenvalues lying to the left of that ordinate in the empirical distributions 
for the £j. The bold entries correspond to the levels of confidence commonly 
used in statistical applications. The reader should compare Table H3to similar 
ones in [H1I22]. 
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3 Painleve Representations: A Summary 



The Gaussian /3-ensembles are probability spaces on iV-tuples of random 
variables {£%, . . . , £n}, with joint density functions Pp given by 6 



-N) 



P 



Cf ] exp 



N 

E 



j<k 



(3.1) 



The C™ are normalization constants, given by 



N 



°/3 



7T 



-N/2 2~N-P N(N-l)/4 . TJ ^ ~^ 7) + 2 ) 



r(i + fj; 



(3.2) 



By setting /3 = 1, 2, 4 we recover the (finite N) Gaussian Orthogonal Ensem- 
ble (GOEat), Gaussian Unitary Ensemble (GUEjv), and Gaussian Symplectic 
Ensemble (GSEjv), respectively. For the remainder of the chapter we restrict 
to these three cases, and refer the reader to for recent results on the gen- 
eral f3 case. Originally the £j are eigenvalues of randomly chosen matrices 
from corresponding matrix ensembles, so we will henceforth refer to them as 
eigenvalues. With the eigenvalues ordered so that 



> 



define 



V2N 



(3.3) 



2-1/2 jy-i/6' 

to be the rescaled m th eigenvalue measured from edge of spectrum. For the 
largest eigenvalue in the /3-ensembles (proved only in the (3 = 1,2,4 cases) 
we have 

t[ N) ^ l u (3.4) 

In many places in the random matrix theory literature, the parameter (3 (times 1/2) 
appears in front of the summation inside the exponential factor i|3.1|) . in addition to 
being the power of the Vandermonde determinant. That convention originated in |24j . 
and was justified by the alternative physical and very useful interpretation of as 
a one-dimensional Coulomb gas model. In that language the potential W — \ J2i $ — 

'EiKj^lh - h\ and p f\t) = Cexp(-WVfcT) = Cexp(-pW), so that = (kT)- 1 
plays the role of inverse temperature. However, by an appropriate choice of specialization 
in Selberg's integral, it is possible to remove the (3 in the exponential weight, at the cost of 



We choose the latter convention in this work 



redefining the normalization constant C\ 
since we will not need the Coulomb gas analogy. Moreover, with computer simulations 
and statistical applications in mind, this will in our opinion make later choices of standard 
deviations, renormalizations, and scalings more transparent. It also allows us to dispose 
of the \[2 that is often present in F4 . 
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whose law is given by the Tracy-Widom distributions. 6 
Theorem 3.1 (Tracy, Widom [3T1 

o 

(x — s) q 2 (x)dx 



F 2 (s) = P 2 (4 <s)= exp 

= Pi(li < s) = (F 2 (s)) 1/2 exp 
F 4 (s) = P 4 (£i < s) = (F 2 (s)) 1/2 cosh 



1 Z" 00 

"ay. 

~2 



q(x)dx 
q(x)dx 



(3.5) 
(3.6) 
(3.7) 



The function g is the unique (see [HI E3) solution to the Painleve II 
equation 

q" = xq + 2q 3 , (3.8) 

such that q(x) ~ Ai(a;) as a; — > oo, where Ai(x) is the solution to the Airy 
equation which decays like 1 7r -1 / 2 a;~ 1//4 exp (— | x 3//2 ) at +oo. The density 
functions fp corresponding to the Fp are graphed in Figure El 7 Let F 2 (s,m) 




Figure 1: Tracy-Widom Density Functions 

denote the distribution for the m th largest eigenvalue in GUE. Tracy and 
Widom showed 0T] that if we define F 2 (s, 0) = 0, then 



F 2 (s, m + 1) — F 2 (s, m) 



\ i 1 1 



ml d\ m 



D 2 (s,\) 



A=l 



m > 0, 



(3.9) 



7 Actually, for (3 = 4, the density of F^V^s) is graphed to agree with Mehta's original 
normalization 24 as well as with . 
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where (12. 2 j) has the Painleve representation 



D 2 (s, A) = exp 



(x — s) q 2 (x, \)dx 



(3.10) 



and q(x, A) is the solution to (|3.8j) such that A) ~ \f\ A\(x) as x — ► oo. 
The same combinatorial argument used to obtain the recurrence (13 .9|) in the 
(3 = 2 case also works for the (3 = 1, 4 cases, leading to 



i^s, m + 1) - m) = 1 J Dj /2 (s, A) I A=1 , m > 0, /3 = 1, 4, 



i\m Jn_ 
4 ) d nV2/ 

a, a; | a=1 , »t ^ u, p — 

(3.11) 

where i^s, 0) = 0. Given the similarity in the arguments up to this point 
and comparing ()3.10|) to ()3.5jh it is natural to conjecture that Dp(s, A),/? = 
1,4, can be obtained simply by replacing q(x) by q(x, A) in (|3.6|) and (|3.7|) . 

That this is not the case for (3=1 was shown by Dieng ^01 • A hint 
that /3 = 1 is different comes from the following interlacing theorem. 

Theorem 3.2 (Baik, Rains |3J). In the appropriate scaling limit, the dis- 
tribution of the largest eigenvalue in GSE corresponds to that of the second 
largest in GOE. More generally, the joint distribution of every second eigen- 
value in the GOE coincides with the joint distribution of all the eigenvalues 
in the GSE, with an appropriate number of eigenvalues. 

This interlacing property between GOE and GSE had long been in the 
literature, and had in fact been noticed by Mehta and Dyson [22] • In this 
context, Forrester and Rains classified all weight functions for which 
alternate eigenvalues taken from an orthogonal ensemble form a correspond- 
ing symplectic ensemble, and similarly those for which alternate eigenvalues 
taken from a union of two orthogonal ensembles form an unitary ensemble. 
The following theorem gives explicit formulas for Di(s, A) and D^s, A); and 
hence, from (|3.11|) . a recursive procedure to determine Fi(-,m) and F^-^m) 
for m > 2. 

Theorem 3.3 (Dieng [10, llj). In the edge scaling limit, the distributions 
for the m th largest eigenvalues in the GOE and GSE satisfy the recurrence 
(13~TTJ) with 6 

- A - 1 - cosh/i(g, A) + y 7 ! sinh/i(g, A) 
Di{s, A) = D 2 (s, A) , (3.12) 
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D A (s, A) = D 2 (s, A) cosh 2 r^-j , (3.13) 

where 

/•oo 

[i(s,\):= q(x,X)dx, A:=2A-A 2 , (3.14) 

J s 

and q(x, A) is the solution to (13.8)1 such that q(x, A) ~ Va Ai(x) as x — > oo. 

Note the appearance of A in the arguments on the right hand side of ()3.12|) . 
In Fig. 2 we compare the densities f\(s, m), m — 1, . . . , 4, with finite iV GOE 
simulations. This last theorem also provides a new proof of the Baik-Rains 
interlacing theorem. 

Corollary 3.4 (Dieng fTUl fTT^. 

F 4 {s,m) = Fi(s,2m), m > 1. (3.15) 

The proofs of these theorems occupy the bulk of the remaining part of 
the chapter. In the last section, we present an efficient numerical scheme 
to compute Fp(s,m) and the associated density functions fp{s,m). We im- 
plemented this scheme using MATLAB™ , 8 and compared the results to 
simulated Wishart distributions. 



4 Preliminaries 

4.1 Determinant matters 

We gather in this short section more or less classical results for further ref- 
erence. 



Theorem 4.1. 



] [ {xj ~ x k ) 4 = det (x{ jx{ 1 )j=o,...,27v-i 

f .. r k=l,...,N 
0<j<k<N ' ' 



8 MATLAB™ is a registered trademark of The MathWorks, Inc., 3 Apple Hill Drive, 
Natick, MA 01760-2098; Phone: 508-647-7000; Fax: 508-647-7001. Copies of the code are 
available by contacting the first author. 
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-10 -8 -6 -4 -2 



2 4 



Figure 2: 10 4 realizations of 10 3 x 10 3 GOE matrices; the solid curves are, 
from right to left, the theoretical limiting densities for the first through fourth 
largest eigenvalue. 

Theorem 4.2. If A,B are Hilbert-Schmidt operators on a general 9 Hilbert 
space TC, then 



det (I + AB) = det(I + BA). 



Theorem 4.3 (de Bruijn, |7J). 




l<j,k<N 



d n(x\) ■ ■ ■ d h(xn) 



(4.1) 



N\ det 






sgn(x - y)(fj{x) ip k {x) d fi(x) d (i(y) 



l<j,k<N 



(4.2) 



See ^S] for proof. 
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detiipjixk) V ; j(^)) 1 < i < 2A r^At(^i) • ■■d/j l (x N ) 

(4.3) 



(2N)\ Pf ( / ^(x)^(x) - ip k {x)^{x) dfi(x) 

' l<j,k<2N 

where Pf denotes the Pfaffian. The last two integral identities were discovered 
by de Bruijn [7j in an attempt to generalize the first one. The first and last 
are valid in general measure spaces. In the second identity, the space needs to 
be ordered. In the last identity, the left hand side determinant is a 2 N x 2 iV 
determinant whose columns are alternating columns of the (pj and ipj (i.e. the 
first four columns are {(pj(xi)}, {V>j(£i)}, {(pj(x 2 )}, {ipj(x 2 )}, respectively 
for j — 1, ... ,2 N), hence the notation, and asymmetry in indexing. 

A large portion of the foundational theory of random matrices, in the 
case of invariant measures, can be developed from Theorems 14.21 and 14.31 as 
was demonstrated in [33J EH! • 

4.2 Recursion formula for the eigenvalue distributions 

With the joint density function defined as in (|3.1|) . let J denote the interval 
(t, 00), and X = Xj(x) its characteristic function. 10 We denote by X = 1 — X 
the characteristic function of the complement of J, and define X x = 1 — A X. 
Furthermore, let Ep^it^m) equal the probability that exactly the m largest 
eigenvalues of a matrix chosen at random from a (finite N) /3-ensemble lie 
in J. We also define 

G/3 lN {t, A) = J ' ■■■ J X A (xi) • ■■X\{x N ) Ppixi, . . .,x N )dx! ■■■dx N . (4.4) 

For A = 1 this is just Ep^(t, 0), the probability that no eigenvalues lie in 
(t, 00), or equivalently the probability that the largest eigenvalue is less than 
t. In fact we will see in the following propositions that Gp^(t, A) is in some 
sense a generating function for Ep^it^m). 

Proposition 4.4. 

G/3 tN (t,X) = J2(-X) k ( J / • • [p l3 (x 1 ,...,x N )dx 1 ---dx N . (4.5) 

A;=0 \ / J J 



Xi£J 



10 Much of what is said here is still valid if J is taken to be a finite union of open intervals 
in R (see [12]). However, since we will only be interested in edge eigenvalues we restrict 
ourselves to (£,00) from here on. 
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Proof. Using the definition of the X x (xi) and multiplying out the integrand 
of (jOj) gives 



Gg iN (t,X) = ^{-^) k J ■ ■ ■ J e k (X{xi),...,X{x N ))Pp(xi,...,x N )dxi---d 



Xn, 



where, in the notation of [00], e k = m X k is the k th elementary symmetric 
function. Indeed each term in the summation arises from picking k of the 
AX-terms, each of which comes with a negative sign, and N — k of the l's. 
This explains the coefficient (— X) k . Moreover, it follows that e k contains (^) 
terms. Now the integrand is symmetric under permutations of the X{. Also if 
Xi $l J, all corresponding terms in the symmetric function are 0, and they are 
1 otherwise. Therefore we can restrict the integration to x,i G J, remove the 
characteristic functions (hence the symmetric function), and introduce the 
binomial coefficient to account for the identical terms up to permutation. □ 



Proposition 4.5. 

E B>N {t,m) 



\m Jm 



m . 



d\ r 



G/3 t N(t, A) 



m > 0. 



(4.6) 



A=l 



Proof. This is proved by induction. As noted above, Eg^it.Qi) = Gp^{t, 1) 
so it holds for the degenerate case m = 0. When m = 1 we have 



~G B>N (t,X) 



A=l 



_d_ 

Jx 

N 

£ 

3=1 



(^l) " " * "X- \ (*^n) ^3 (*^0 ^ ' ' * d $ r 

X(x 1 )---X(x j - 1 )X(x j )X(x j+1 )--- 



A=l 



• ■ ■ X(xn) Pg (x) dxi ■ ■ -dxN- 

The integrand is symmetric under permutations so we can make all terms 
look the same. There are N = (f) of them so we get 



A=l 



1 

Ep,N(t, !)■ 



X{x\) X(x 2 ) ■ ■ -X(xn) Pg (x) dx\ ■ ■ -dxN 
X(x\) X(x 2 ) ■ ■ ■ X(xn) Pg N \x) dx\ ■ ■ ■ dxiy 



A=l 
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When to = 2 then 

1 ' 
2 



7a 



Gfi,N(t, A)| A=1 



AT 



AT 

E 

i=2 



X(xi)X(x 2 ) ■ • • X(xj-i) X(xj) X(xj+i) 



X(xn) Pr N \x) dxi ■ ■ ■ dxN 



A=l 



N(N -1) 



■ J X(xi) X(x 2 ) X(x 3 ) ■ ■ ■ X{xn) Pp N \x) dxi ■ • • dxN 



A=l 



X(xi) X(x 2 ) X(x 3 ) ■ ■ ■ X(xn) Pg N \x) dxi ■ ■ ■ dxN 



A=l 



= (J^J J ■■ J X{x 1 )X(x 2 )X{x 3 )---X(x N )Pf\x)dx 1 ---dx N 
= Ep jN (t, 2), 

where we used the previous case to get the first equality, and again the 
invariance of the integrand under symmetry to get the second equality. By 
induction then, 



1 

to ! 



d 

"d\ 



G/3,N(t, A)| A=1 



N (N — i) • • • (N — m + 2) 



m i 



N 

E 

j=m ■ 



X( Xl )X(x 2 ) 



■ X(xj-i) X(xj) X(x j+ i) ■ ■ ■ X(x N ) Pi N \x) dxi ■ ■ ■ dx N 



N(N -1)---(N -m + 1) 



m i 



P 

X( Xl ) 



• X(x m ) X(x m+ i) ■ ■ ■ X(x N ) P ( N) (x)dxi---dx N 



TO, 

E P:N (t,m). 



X(xi) ■ ■ -X(x m ) X(x m+ i) ■ ■ - X(x N ) P^\x)dxi ■■■dx N 



□ 



If we define Fp jN (t,m) to be the distribution of the m th largest eigen- 
value in the (finite N) /3-ensemble, then the following probabilistic result is 
immediate from our definition of Ep^ N {t,m). 
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Corollary 4.6. 

Fp }N (t, m + 1) - Ff) jN (t, m) = E^ N (t, m) 



(4.7) 



5 The Distribution of the m th Largest Eigen- 
value in the GUE 

5.1 The distribution function as a Predholm determi- 
nant 

We follow [SI] for the derivations that follow. The GUE case corresponds to 
the specialization (3 = 2 in (|3.1|) so that 



G 2 ,N(t, A) = Cf ] J ■■■J Y\_( X J ~ x kf W w { x j) W (1 + /(%)) dxi---dx N 

(5-1) 

where w{x) = exp (— x 2 ), f(x) = —\Xj(x), and C% depends only on N. 
In the steps that follow, additional constants depending solely on N (such 
as N\) which appear will be lumped into C%- A probability argument will 
show that the resulting constant at the end of all calculations simply equals 
1. Expressing the Vandermonde determinant 



1 I (Xj - x k ) = det (x{) j=o,...,N (5.2) 

i<r<i<N k=i - N 

and using (|4.1|) with (pj(x) = ipj(x) = x^ and dfi(x) = w(x)(l + f{x)) yields 
G 2 , N (t,X) = C { 2 N) det ( [ x j+k w(x)(l + f(x)) dx)) . (5.3) 

\Jm. / j,k=Q,...,N-l 

Let {(fij(x)} be the sequence obtained by orthonormalizing the sequence 
[x j w 1 ^ 2 (x)}. It follows that 

G 2 ,*(*,A) = cf° det (f ifiix) Mx) (l + f(x))dx)) (5.4) 

\Jm. J j,k=0,...,N-l 

= det f S jtk + / fj{x) (pk{x) f(x)dx) J (5.5) 

V Jn J j,k=a,...,N-i 
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The last expression is of the form det(I+AB) for A : L 2 (M.) — > with kernel 
A(j,x) = (fj(x)f(x) whereas B : C N — > L 2 (M.) with kernel B(x,j) = fj(x). 
Note that AB : -> C N has kernel 

AB(j,k)= / (pj(x)(p k (x) f(x)dx (5.6) 
Jm. 

whereas BA : L 2 (M) -> L 2 (M) has kernel 

N-l 

BA(x, y) = ^2 Vk( x ) := K ^n(x, y). (5.7) 

k=0 

From Theorem 14.21 it follows that 

G 2>N {t, A) = cf° det (I + tf 2i * /) , (5.8) 

where K2,jv has kernel K 2) N(x,y) and K 2>N f acts on a function by first 
multiplying it by / and acting on the product with K 2j n- From ()5.1|) we 
see that setting / = in the last identity yields C% = 1- Thus the above 
simplifies to 

G 2>N (t, \) = det (I + K 2>N f). (5.9) 



5.2 Edge scaling and differential equations 

We specialize w(x) = exp (— x 2 ), f(x) = —\Xj(x), so that the {<fj(x)} are 
in fact the Hermite polynomials times the square root of the weight. Using 
the Plancherel-Rotach asymptotics of Hermite polynomials, it follows that 
in the edge scaling limit, 

lim ——- — -—- Kj\ 1 ' " ' ' 



(5.10) 

is i^Ai(^, 2/) as defined in ()2.1|) . As operators, the convergence is in trace class 
norm to K 2 . (A proof of this last fact can be found in [26J.) For notational 
convenience, we denote the corresponding operator K 2 by K in the rest of 
this subsection. It is convenient to view K as the integral operator on M. with 
kernel 

K(x, y) = Xj{y), (5.11) 

x-y 
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where ip(x) = y/\ Ai(x), tp(x) = VXAi'(x) and J is (s, oo) with 

t = v / 2iV+ ^ S ; . (5.12) 

Note that although K(x,y), ip and ip are functions of A as well, this depen- 
dence will not affect our calculations in what follows. Thus we omit it to 
avoid cumbersome notation. The Airy equation implies that •p and ip satisfy 
the relations 

d 

~r<p = v>, 

ax 

-^—ib = xip. (5.13) 
ax 

We define D 2 ,n{s, A) to be the Fredholm determinant det (I — K Nj2 )- Thus 
in the edge scaling limit 

lim D 2> n(s, A) = D 2 (s, A). 

We define the operator 

R=(I-K)- 1 K, (5.14) 

whose kernel we denote R(x,y). Incidentally, we shall use the notation = in 
reference to an operator to mean "has kernel". For example R = R(x,y). 
We also let M stand for the operator whose action is multiplication by x. It 
is well known that 

4- log det (J - K) = -R(s,s). (5.15) 
as 

For functions / and g, we write / <g) g to denote the operator specified by 

f®g = f(x)g(y), (5.16) 

and define 

Q(x,s) = Q(x) = ((I-K)- 1 ip)(x), (5.17) 
P(x,s) = P(x) = ((I — K)~ l ip) (x). (5.18) 

Then straightforward computation yields the following facts 

[M,K] = ip®ip-il)®ip, 
[M ,(J - KY 1 ] = (I - K)~ 1 [M ,K](I - K)' 1 

= Q®P-P®Q. (5.19) 
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On the other hand if (I — K) 1 = p(x, y), then 

p(x,y) = 8(x-y)+R(x,y), (5.20) 

and it follows that 

[ M , (I - K)- 1 ] = (x - y)p(x, y) = (x- y)R(x, y). (5.21) 
Equating the two representation for the kernel of [M , (I — Ky 1 ] yields 

R(x , y) = Q(*) p (v)-P(*)Q(y) (5 22) 

x-y 

Taking the limit y — > x and defining q(s) = Q(s, s), p(s) = P(s, s), we obtain 

R(s, s) = Q'(s, s) p(s) - P'{s, s) q(s). (5.23) 

Let us now derive expressions for Q'(x) and P'(x). If we let the operator D 
stand for differentiation with respect to x, 

Q'(x,s) = D(I — Ky 1 if 

= (I — K)^ 1 Dip + [D, (I — K)^ 1 ] if 

= {I-Ky l ^+[D,{I-Ky l ]f 

= P(x)+[D,(I-Ky 1 ]<p. (5.24) 

We need the commutator 

[D, (/ - Ky 1 ] = (/ - K)' 1 [D, K] (/ - Ky 1 . (5.25) 

Integration by parts shows 

[D, K]=(^ + ^j+ K(x, s)S(y - s). (5.26) 

The 5 function comes from differentiating the characteristic function x- More- 
over, 

OK 8K\ . . . . . _ 

+ — = <p(x)<p(y). (5.27) 



dx dy 
Thus 

[D, (I - Ky 1 ] = -Q{x)Q{y) + R(x, s)p(s, y). (5.28) 
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(Recall (I — K) 1 = p(x,y).) We now use this in (|5.24j) to obtain 

Q'(x,s) = P(x)-Q(x)(Q,(p)+R(x,8)q(s) 
= P(x) — Q(x)u(s) + R(x, s)q(s), 

where the inner product (Q, ip) is denoted by u(s). Evaluating at x = s gives 

Q'{s, s) = p{s) - q{s)u{s) + R{s, s)q(s). (5.29) 
We now apply the same procedure to compute P' . 

P'(x,s) = D(I-K)' 1 ^ 

= (I — K)" 1 Dip + [D, (I — Ky 1 ] ip 

= M(I- KY 1 <p+[(I- Ky 1 ,M]<p + [D, (I - Ky 1 ] ip 

= xQ(x) + {P®Q-Q®P)p + (-Q ® Q)ip + R(x, s)p(s) 

= xQ{x) + P{x) (Q, ip) - Q(x) (P, if) - Q{x) (Q, ip) + R(x, s)p{s) 

= xQ(x) - 2Q(x)v(s) + P(x)u(s) + R(x, s)p(s). 

Here v = (P, <p) = (ip, Q). Setting x = s we obtain 

P'(s, s) = sq{s) + 2q(s)v(s) + p{s)u{s) + R{s, s)p{s). (5.30) 

Using this and the expression for Q'(s, s) in (|5.23j) gives 

R{s, s)=p 2 - sq 2 + 2q 2 v - 2pqu. (5.31) 

Using the chain rule, we have 

<h ' ( ° +#W,*)U- (5.32) 



ds \dx ds t 
The first term is known. The partial with respect to s is 
dQ(x,s) xdK ! 

= -R(x,s)q(s), 



where we used the fact that 

dK 



= -K(x,s)5(y-s). (5.33) 



ds 
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Adding the two partial derivatives and evaluating at x = s gives 



-r— = p — git. (5.34) 
a s 

A similar calculation gives 

-r^ = sq — 2qv + pu. (5.35) 
a s 

We derive first order differential equations for u and v by differentiating the 
inner products. Recall that 



u(s) = / ip(x)Q(x, s) dx. 



Thus 



du [°° dQ(x,s) 

-Lp{s)q{s) + / (p{x) dx 



ds J s ds 

' POO 

(f(s) + / R(s,x)(p(x) dx \ q(s) 



-Q 



(I - K)- 1 ip{s) q(s) 

2 



Similarly, 



= -PQ- ( 5 - 36 ) 

a s 

From the first order differential equations for q, u and v it follows immediately 
that the derivative of u 2 — 2v — q 2 is zero. Examining the behavior near s = oo 
to check that the constant of integration is zero then gives 

u 2 - 2v = q 2 . (5.37) 

We now differentiate (J5.34j) with respect to s, use the first order differential 
equations for p and u, and then the first integral to deduce that q satisfies 
the Painleve II equation (|3.8|) . Checking the asymptotics of the Fredholm 
determinant det(7 — K) for large s shows we want the solution with boundary 
condition 

q{s,X) ~ V / AAi(s) as s — > oo. (5.38) 
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That a solution q exists and is unique follows from the representation of the 
Fredholm determinant in terms of it. Independent proofs of this, as well as 
the asymptotics as s — > oo were given by [TH], 0,0- Since [D , (I — K)^ 1 ] = 
(d/dx + d/dy) R(x,y), (E^KJ) savs 

+ — )#(*,</) = -Q(ar)Q(3/) + R(x,s)p(s,y). (5.39) 



<9x cfr/ 

In computing dQ(x, s)/ds we showed that 



^ (J - K)" 1 = ^i?(x, y) = -i2(x, s)p(s, y). (5.40) 



Adding these two expressions, 



4 9a; <9y 9s 
and then evaluating at x — y — s gives 

d 



' ) + 1- + 1- ) #0, y) = -Q(?) Q(y)i ( 5 - 41 ) 



, R(s,s) = -q 2 . (5.42) 

a s 

Integration (and recalling (|5.15|l ) gives, 

— logdet(I-it) =- / q 2 (x,X)dx; (5.43) 
ds J s 



and hence, 



/OO / POO \ />c 

(j g 2 (a;,A)da;J dy = -J 



(x — s)q 2 (x, A) fix. 



(5.44) 

To summarize, we have shown that D2(s, A) has the Painleve representation 
(j3.10|) where q satisfies the Painleve II equation ()3.8|) subject to the boundary 
condition (15.3 
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6 The Distribution of the m th Largest Eigen- 
value in the GSE 

6.1 The distribution function as a Predholm determi- 
nant 

The GSE corresponds case corresponds to the specialization /3 = 4 in (jH.lj) 
so that 



r r N N 

G 4tN {t,X) = Cf' / • • • / JJ (xj - x k f JJw(£j) JJ(1 + f{xj)) dxi---d: 



N N 

■ ■ ■ (I J' v 

j<k j j 

(6-1) 

where w(x) = exp (— x 2 ), f(x) = —A Xj(x), and depends only on N. As 
in the GUE case, we will absorb into any constants depending only on 
N that appear in the derivation. A simple argument at the end will show that 
the final constant is 1. These calculations follow By (|4.1|) . G^it, A) is 
given by the integral 



r r N N 

C{ N) J ■■■ J det(x J k j a^T 1 ) 3=0,^,2 n-i JJ_w(x i )JJ(l+f(x i ))dxi---dx_x 



i=l i=l 



which, if we define <Pj(x) = x^ 1 w(x) (1 + f(x)) and ipj(x) = (j — 1) x^ 2 and 
use the linearity of the determinant, becomes 

G 4 ,iv(t, A) = C[ N) • • / det((pj(x k ) ip(x k ))i<j< 2 N dx x ■ ■ ■ dx N . 

J J Kk<N 



Now using ([4.3)1 . we obtain 
G AyN (t,\) = C[ N > Pf ( / (pj(x)ip k x - (p k (x)ipj(x) dx 



l<j,fc<2 JV 

'W Pf I I fir - ,'Wi+ fc -3, 



Q JVj Pf f y (As — j) x J+fe - 3 w(x) (1 + f(x))dx 
cf° Pf M (A;-j)^' +fe - 1 w;(x)(l + /(x))c/ 



l<j,fc<2 JV 
0<j,fc<2 JV-1 
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where we let k — > k + 1 and j ; — > j ; + 1 in the last line. Remembering that 
the square of a Pfaffian is a determinant, we obtain 



G'l N (t,X) =C[ N) det ( / (k-j)x J+ «- L w(x)(l + f(x))dx 

0<j,k<2 JV-1 

Row operations on the matrix do not change the determinant, so we can 
replace {x 3 } by an arbitrary sequence {pj(x)} of polynomials of degree j 
obtained by adding rows to each other. Note that the general (j, k) element 
in the matrix can be written as 



d 

dx 



x k I x 3 - 



d 

dx 



x 3 ) x k 



w(x) (! + /(*)). 



Thus when we add rows to each other the polynomials we obtain will have 
the same general form (the derivatives factor). Therefore we can assume 
without loss of generality that G\ N {t, A) equals 

C[ N) det ( j [ Pj (x)p' k (x) - v' 3 (x)Vk(x)] w(x) (1 + /(*)) dx) 

' 0<j,k<2 N-l 

where the sequence {pj{x)} of polynomials of degree j is arbitrary. Let 
ipj = Pj w x l 2 so that pj = ipj w~ l l 2 . Substituting this into the above formula 
and simplifying, we obtain 



G'i N (t,X) = Cf°det J ((^(x)^ k (x)- ^ k (x)^(x))(l + f(x))) dx 

= C[ N) det [M + L] = Cf° det[M] det [7 + M' 1 ■ L], 
where M, L are matrices given by 



0<j,fc<2 JV-1 



M 



(ipjix) ip' k (x) - tp k (x) ipjix)) dx 



0<j,k<2 JV-1 



(ipj(x) ip k (x) - ip k (x) ipjix)) f(x)dx 

' 0<j,k<2 JV-1 

Note that det[M] is a constant which depends only on N so we can absorb 
it into Cf°. Also if we denote 

2JV-1 
k=0 
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it follows that 



M- 1 -N=U faWi/iix) - r/^Mx)) f(x)dx 

^ J ' 0<j,fe<2 JV-1 

Let A : L 2 (R) x L 2 (R) -> C 2Ar be the operator defined by the 2iV x 2 matrix 



/ Vo(x) -rj'o(x) \ 



J 



Thus if 



we have 



e L (M) x L 



Ag = A{x)g 



( J (Vo9o ~ Vo9i) dx\ 
J (Vi9o ~ V'i9i) dx 



\ 



e C 



2N 



J 



Similarly we define B : C 2n -> L 2 (R) x L 2 (1R) given by the 2 x 2n matrix 



B(x) = f 



ip' (x) ip[(x) 
tpo(x) Vi(^) 



Explicitly if 



a 



then 



Ba = B(x) ■ a 



( «o \ 



eC 2N , 



/ 2N-1 \ 



i=0 
2 JV-1 



e L 2 x L 2 . 



i=0 
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Observe that M~ l ■ L = AB : 



<<2n 



C 2n . Indeed 



/ 



ABa 



2N-1 



E 

i=0 
2JV-1 

E 

i=0 L 



J 



{vrfk ~ Vftk) fdx 



(M -1 • L) a. 



Therefore, by (|4.2j) 

G^(t, A) = cf° det(J + M" 1 ■ L) = cf° det(J + AB) = cf° det(J + BA) 
where BA : L 2 (R) — > L 2 (R). From our definition of A and i? it follows that 

/ 2n-l / « \ \ 



5^ 



/S^* ( / (^9o-v'i9i) dx 

i=0 ^ 
2N-1 , 

^zl^\J ^ i9 ° ~ rfiSi) dx 



i=0 

l n 2N- J 



/ 



./ 



V 



2N-1 



Y ^'i( x )Vi(y)9o(y)dy - / ^ $(aO^(y)0i(i/)dj/ 

1=0 ^ 8=0 

!JV-1 -2N-1 

^2 ijj i (x)r] i (y)g (y)dy- / ^ ipi( x Wi(v)9i(y) d V 



i=0 



i=0 



= fK AjN g, 

where jv is the integral operator with matrix kernel 



/ 2JV-1 2/V-l \ 

Y ^'i{x)r]i{y) ~ Y ^i(xK(y) 



i=0 
2N-1 



j=0 
2/V-l 



XI Mrfviiv) - Y ^( x H(y) 



i=0 



i=0 



/ 
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2JV-1 



Recall that rjj(x) = ^j k ipk{x) so that 



fc=0 



/ 2JV-1 



2N-1 



j,k=0 
2N-1 



j,k=0 
2N-1 



j,k=0 



\ j,k=0 

Define e to be the following integral operator 



/ 



e = e{x- y) 



\ if x > y, 
— \ if x < y. 



(6.2) 



As before, let D denote the operator that acts by differentiation with respect 
to x. The fundamental theorem of calculus implies that D e = eD = I. 
We also define 

2N-1 

S N (x,y) = Y i , 'j( x )^iki'k{y)- 

j,k=0 



Since M is antisymmetric, 

2N-1 

S N (y,x) = Y ipj(y)vjkipki 

j,k=0 

after re-indexing. Note that 

2N-1 



x 



2N-1 2N-1 

Y ^j(y)vkj^k{ x ) = - Y ^Ay)^Mx), 

j,k=0 j,k=0 



2N-1 



eS N (x,y) = Y ^ D ^j(x)/i jk ^k(y) = Y ^A x )^ik^k{y), 

j,k=0 j,k=0 



and 



2N-1 



—j-S N (x,y) = Y ^j( x )^jk^k(y)- 
Thus we can now write succinctly 



K N (x,y) 



S N (x,y) -^S N (x,y) 
eS N (x,y) S N (y,x) 



(6.3) 
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To summarize, we have shown that G\ N (t, A) = C 4 det(J— K^f). Setting 
/ = on both sides (where the original definition of G^(t, A) as an integral 
is used on the left) shows that Cj = 1. Thus 



G iiN (t,X) = y/D 4iN (t,X), (6.4) 

where we define 

D 4>N {t,X)=det{I + K 4)N f), (6.5) 
and K 4j jsr is the integral operator with matrix kernel (|6.3j) . 

6.2 Gaussian specialization 

We would like to specialize the above results to the case of a Gaussian weight 
function 

w(x) = exp (x 2 ) (6.6) 

and indicator function 

f(x) = -XX J , J=(t,oo). 

We want the matrix 

M ={J (^-(x)^(x) - ij) k (x) ipj(x)) dx 

to be the direct sum of N copies of 

so that the formulas are the simplest possible, since then fijk can only be 
or ±1. In that case M would be skew-symmetric so that M _1 = —M. In 
terms of the integrals defining the entries of M this means that we would 
like to have 

ip2j(x) ip2k+i(x) - 4>2k+i(x) j^4>2j(x)^ dx = 8 jtk , 

1p 2 j+l(x) 3— 1p2k{x) - 4>2k{x) 3— 1p2j+l(x) J dx = -6 jsk 

dx dx ) 
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0<j,fc<2 N-l 



and otherwise 

^A x )^-^k{x) - ipj(x) ip k (x) J dx = 0. 
ax dx J 

It is easier to treat this last case if we replace it with three non-exclusive 
conditions 

tp2j{x) -j^^2k{x) - tp2k(x) ^~^ 2 i( x )^) dx = °> 

^2j+l(x) -^-iJ 2 k+l(x) - 4>2k+l(x) dx = 0, 

(so when the parity is the same for j, k, which takes care of diagonal entries, 
among others) and 

T~~ ^k(x) - ipj{x) — ipk{x)) dx = 0, 
dx dx J 

whenever \j — k\ > 1, which targets entries outside of the tridiagonal. Define 

1 



fk{x) = — H k (x) exp(— x 2 /2) for c k = \j2 k k\\fn (6.7) 

where the H k are the usual Hermite polynomials defined by the orthogonality 
condition 



/ Hj(x) H k (x) e x2 dx = c 2 j 8 jtk . 
Jr. 



Then it follows that 

/ ifj(x) ip k (x)dx = 8 jjk . 
Jr 

Now let 

^2J+l ( X ) = ~J= f 2 J+l ( X ) ^2J ^ = ~^ € ^ 

This definition satisfies our earlier requirement that ipj = pj w 1 ^ 2 with w 
defined in (16.6)1 . In particular we have in this case 

P2j+i(x) = —=H 2j+1 (x). 
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Let e as in (|6.2j) . and D denote the operator that acts by differentiation with 
respect to x as before, so that D e = e D = I. It follows that 



d 



d 



% ( x ) J~ x VWi ( x ) - VWi ( x ) j- x % ( x ) 



dx 



" 6 ^2 J+ i ( X ) -T- ¥ 2k+1 (x) + p 2k+1 (x) — € p 2j+1 [X) 



d 

dx 
d 

dx 



d 

dx 



dx 



- e ^2 J+ i (x) -T- V 2k+1 (x) + ¥ 2k+1 (x) <f 2j+1 (x) 



dx 



We integrate the first term by parts and use the fact that 

■^e<p. (x) = i Pj (x) 
and also that </?. vanishes at the boundary (i.e. if. (±00) = 0) to obtain 



d 



d 



^ (x) J~x ^ 2k+i (x) ~ ^ 2k+i ^ 'die ^ (x) 



dx 



d 



- e ^ 2]+ i (x) \ x ) + ^ 2fc+ i w Wi w 



dx 



\ I [<P V+ 1 ( X ) f2k+l ( X ) + P2k+l ( X ) ( X )} dx 

[p 2j+1 (x) f 2k+1 (x) + ip 2k+1 (x) <p 2j+1 (x)] dx 
(S jtk + S j)k ) 



1 

2 
1 

2 



as desired. Similarly 



' ' 1 1 ( C) Ji ^ 2h (X) ~ ^ 2k ^ Ji ^ 2J+1 (X) 



2j + l 
1 

2 



dx 



d 



+ 1 i X ) J^ e ^2k + 1 ( X ) +^2 k+ l ( X ) <P2 j+ l (•'•» 



dx 



Moreover, 



X 



CjV2 



H 2j+1 (x) 
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is certainly an odd function, being the multiple of and odd Hermite poly- 
nomial. On the other hand, one easily checks that e maps odd functions to 
even functions on L 2 (IR). Therefore 



p 2j (x) 



e H 2j+1 (x) 



is an even function, and it follows that 



/ 

Jr 



d 



d 



(x) ip 2j (x) - ip 2j (x) — ip 2k (x) 



dx 



P2j(x)-^p 2k (x) -P2k(x)^p 2 j(x) 



w(x) dx 



0, 



since both terms in the integrand are odd functions, and the weight function 
is even. Similarly, 



dx 



P2j+l (x) P2k+1 (x) ~ P2k+1 (X) P2j+1 (x) 

dx dx 



w(x) dx 



0. 



Finally it is easy to see that if \j — k\ > 1 then 



/ 



^ (X) ^ k (X) ~ ^ ^ Ix~ ^ k ^ 



dx = 0. 



Indeed both differentiation and the action of e can only "shift" the indices 
by I. Thus by orthogonality of the (pj, this integral will always be 0. Hence 
by choosing 

(x) = ^= <P 2j+1 (x) , i> 2i (x) = --j=e <f 2j+1 (x) , 
we force the matrix 



M 



(ipj(x) ip' k (x) - ijj k (x) ip'j(x)) dx 



0<i,fe<2 Ti-l 
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to be the direct sum of N copies of 

Z = 



1 
-1 



Hence M~ l = -M where M -1 = {nj, k }j,k =o,2JV-i- Moreover, with our above 
choice, fj,j t k = if j,k have the same parity or \j — k\ > 1, and H2k,2j+i = 
Sjk = —(i2j+i,2k for j, k = 0. . . . , N — 1. Therefore 

27V- 1 



S N (x,y) = -^2 ip , j(x)fij k il>k(y) 

j,k=0 
N-l 

= - ^-0) ^2j+i(y) + J] ^h-i^) My) 



N-l 



3=0 
N-l 



3=0 



N-l 



.3=0 3=0 



Recall that the Hj satisfy the differentiation formulas (see for example [2], 
p. 280) 

H' j (x) = 2xH j (x)-H j „ l {x) j = 1,2,... (6.8) 



H' j {x) = 2jH j _ 1 {x) j = l,2,.. 
Combining ()6.7|) and ()6.8|) yields 

^■(x) = x<pj(x) - ^<p j+1 (x). 



Similarly, from ()6.7|) and ()6.9|) we have 



c .i-i 



M x ) = -x(pj(x) + 2j (pj-i(x) 



(6.9) 



(6.10) 



(6.11) 



Combining (|6.10|) and (JHHTJ) , we obtain 



J -^j-i(x) - J Lt- (p j+1 ( x ) 



(6.12) 
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\ 




I 


A 


\ 


Let ip = 






and if 1 = 




A 2 








J 




\ 




) 



. Then we can rewrite (16.12)1 as 



ip' = A ip 



where A = {aj^} is the infinite antisymmetric tridiagonal matrix with a 3 - 



. Hence, 



Aj( x ) = y^^jkVkjx)- 

k>0 

Moreover, using the fact that De = eD = Iwe also have 
ipj (x) = etp'j (x) = e I Y a jk ip k (x) 

\k>0 J 

= Y a i k€ipk ( x ) ■ 

k>0 

Combining the above results, we have 

N-l N-l 

Y Vhj+i ( x ) e< ^2i+i (v) = YY ^j+hkVk (x)e<p 2 j+i (x) 

j=0 j=0 k>0 

N-l 

= ~YY a k,2j+i^k {x)eip 2j+ i (x) . 

j=0 k>0 

Note that a^j+i = unless \k — (2j + 1)| = 1, that is unless k is even. Thus 
we can rewrite the sum as 

N-l 

Y Aj+i (x)eif 2j+ i (y) = - Y a kJfk (x)eipj (y) - a 2N2N+1 ip 2N (x)e<p 2N+1 (y) 

3=0 k,j>0 

k even 
k<2N 

= ~ Y 0) Y V <Pj (V) + a 2N,2N+l V2N ( X ) 6 (y) 

j>0 



k>0 
k even 
k<2N 



where the last term takes care of the fact that we are counting an extra term 
in the sum that was not present before. The sum over j on the right is just 
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(p k (y), and a 2N 2N+1 = -J^r 1 - Therefore 



N-l 



3=0 



k>0 
k even 
k<2N 

N 



Y (x) <p 2j (y) 
3=0 



2N+1 



2N+1 



V 2N (x)ep 2N+1 (y) 



¥ 2N W^ 2W (y)- 



It follows that 



Sn(x,v) = - 



2N 



.3=0 



<Pj ( x ) v s (y) - \l 2iV + 1 (?) e (y) 



We redefine 



2N 



s N (x, y ) = Y l P n 0) ¥n (y) = s N(y, x) 



(6.13) 



n=0 



so that the top left entry of K]sr(x,y) is 



Sn{x, y) + \j — 2 — ^ x > e ( y ) ■ 

If Sn is the operator with kernel Sn{%,v) then integration by parts gives 



S N Df = I S(x,y)-£-f(y)dy 



d 

dy 



Sn(x,v) ) f(y)dy, 



so that —j^Sjsf(x,y) is in fact the kernel of SjyD. Therefore (|6.4|) now holds 
with K^n being the integral operator with matrix kernel K 4 ^(x,y) whose 
(i, j)-entry KH\x,y) is given by 



k a,n ( x ,y) 
K 4,N( x ,y) 

K ?N(x,y) 



SN(x,y) + 
SD N (x,y) 
S N (x,y) + 
Sn(x,v) + 



2N + 1 



¥>2N ( X ) €l P2N+l (y) 



d 

dy 



2N + 1 



v 2 N ( x ) e ¥>2N+i (y) 



2N + 1 



2N+1 



¥>2N ( x ) ei f2N+i (y) 



e ¥W ( x ) V2N (v) 
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We let 2N + 1 — > N so that iV is assumed to be odd from now on (this will 
not matter in the end since we will take N -> oo). Therefore the Kf$(x,y) 
are given by 



where 



Define 



so that 



K 



if 



& 2) (^Z/) = 2 



4,JV v-7 a, 2 



Sjv(a:, v) + y~2 ^-i ^iv (?/) 

/jv 

SD N (x,y) - \J — ip N _Ax) (p N (y) 
[W 

s n (x, y) + y~2 Vn-i ( x ) e v N (y) 

W 



S N (x,y) + \l — eip N (x) if N _ 1 (y) 



N-l 



S N (x,y) = ^2<p n (x)<p n (y). 



n=0 



1/4 



1/4 



<p( x ) = ( y ) ^0*0 ^0*0 = ( — J ^-iW. 



= \x(x) [SD N {x,y)-i/>(x)<p(y)] X(y), 
K$\x,y) = \x{x) [eS N (x,y) + ei>(x)e<p(y)} X(y), 
K%${x,y) = \x{x) [S N (x,y) + ecp(x)i;(y)} X(y). 



Notice that 



-X (S + i/>®e<p)X±Kffi(x,y), 
\x {SD-i,® ip) X = K^\x,y), 



X (eS + e*P® eip) X = K$\x, y), 



-X (S + e V ®e^)X = K^\x,y). 
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Therefore 

K iN = \x( S +Jfff S Q D -*»f)x. (6.14) 
4,7V 2 \eS + eip® e<p S + eip^ip J v ; 

Note that this is identical to the corresponding operator for (3 = 4 obtained 
by Tracy and Widom in [3JJ|> the only difference being that (p, ip, and hence 
also S, are redefined to depend on A. This will affect boundary conditions 
for the differential equations we will obtain later. 

6.3 Edge scaling 

6.3.1 Reduction of the determinant 

We want to compute the Fredholm determinant (|6.4|) with given by 

(I6.14j) and / = X(t,oo)- This is the determinant of an operator on L 2 (J) x L 2 (J). 
Our first task will be to rewrite the determinant as that of an operator on 
L 2 (J). This part follows exactly the proof in [33]. To begin, note that 

[S,D] = + (6.15) 

so that, using the fact that D e — eD — I, 

[e,S] = eS-Se 

= eSDe-eDSe = e [S,D] e 

= e <p ® ip e + e 4> ® <p e 
= e(f ® e V + eip <g> £ ip 

= —tip^eip — eip^eip, (6.16) 

where the last equality follows from the fact that e* = — e. We thus have 

D (e S + etp Cg) e<p) = S + tp®€{p, 
D (eSD-eij®(p) = SD-if>®<p. 

The expressions on the right side are the top matrix entries in ()6.14|) . Thus 
the first row of iCt,jv is, as a vector, 

D (eS + eip <g> eip,eS D - eip <g> ip) . 

Now ()6.16|) implies that 

eS + etp®6(p = Se — eip ® eip. 
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Similarly (|6.15|) gives 

e [S ,D] = ep <g> tp + eip ® p, 

so that 

eSD-ei)®p = eDS + ep®ip = S + ep(&il). 
Using these expressions we can rewrite the first row of K^n as 

D (S e — ep®eip,S + ep®ip). 

Now use f!6.16|) to show the second row of K±n is 

(S e — ep®eip ) S + ep®il)). 

Therefore, 

K 4 = x ( D (St-^®^) D(S + ep®ii)\ 
\ Se — ep®eip S + ep®ip J 

( XD \ / (Se-ep®eip) X (S + ep®tp)X 
\ X J \ {Se - ep>®eip)X (S + eip®ip)X 

Since K 4N is of the form A B, we can use 14.21 and deduce that D 4N (s, A) is 
unchanged if instead we take K 4 N to be 



K, 



4,V 



(Se-ep®e%p) X {S + e <p ® if>) X \ ( X D 
(Se - e<p <g> eip) X (S + ep®ij)X J \ X 

(Se - ep <g> eip) XD (S + tp®ip)X 
(Se-etp®eip)XD (S + ep®if})X 



Therefore 



n f .x _ , , ( I- \ (Se-ep®eip) XXD -\ (S + e p ® ip) X X \ 
^M^j-det^ _^ (5 f _ e ^ e ^) AXD I -\(S + ep®ip) XX J 

(6.17) 

Now we perform row and column operations on the matrix to simplify it, 
which do not change the Fredholm determinant. Justification of these oper- 
ations is given in [33] ■ We start by subtracting row 1 from row 2 to get 

r - \ (Se-ep>®eip) XXD -\ (S + ep ® ip) XX 
-I ' I 
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Next, adding column 2 to column 1 yields 



/ l-\ (Se- e(f(g)6ip) XXD-\ (S + ecp®^) XX -|(5 + e^®^)AX 

Thus the determinant we want equals the determinant of 

I -- (Se-e V ®eij) \ XD -- (S + e<^®^) XX (6.18) 

So we have reduced the problem from the computation of the Fredholm 
determinant of an operator on L 2 (J) x L 2 (J), to that of an operator on 
L\J). 

6.3.2 Differential equations 

Next we want to write the operator in ()6.18|) in the form 

L 



(/ - K 2)N ) ( / - J2 ® A J , (6. 19) 

where the «j and /3j are functions in L 2 (J). In other words, we want to rewrite 
the determinant for the GSE CcLSG clS Sb finite dimensional perturbation of the 
corresponding GUE determinant. The Fredholm determinant of the product 
is then the product of the determinants. The limiting form for the GUE part 
is already known, and we can just focus on finding a limiting form for the 
determinant of the finite dimensional piece. It is here that the proof must 
be modified from that in A little rearrangement of ()6.18|) yields (recall 
e* = -e) 

I-^SX-^SeXD-^etp ® X^-^ey? <8> ipeXD. 
Writing e [ X , D ] + X for eX D and simplifying gives 

I - XSX - Xetp <g> i)X-^Se [X,D]- ^ey? ® 4) e [X,D]. 
Let a/A ip — >■ </?, and \/A V ~~ *■ ^ so that A S — > S and (j6.18|) goes to 
I - SX-e<p ® tpX- -Se [X,D]- ~e<p ® ipe[X,D). 



41 



Now we define R :— (I — S X)^ 1 S X = (I — S X)^ 1 — I (the resolvent operator 
of Sx), whose kernel we denote by R(x,y), and Q e := (/ — SX)^ 1 e<p. Then 
f!6.18|) factors into 

A= (I-SX)B. 

where B is 

I-Q e ® X^-^(I + R)Se [X,D] - ~ (Q e ®1>)e[X,D] 

Hence 

D itN (t,X) = D 2tN (t,X) det(B). 
In order to find det(B) we use the identity 

2m 

e [X,D] = ^(-l) fc e fc ®4, (6.20) 
k=i 

where e k and 5 k are the functions e(x — a^) and 5(x — a^) respectively, 
and the a k are the endpoints of the (disjoint) intervals considered, J = 
U^L 1 (a 2 fc-i, a 2 fc). In our case m = 1 and a% = t, a 2 = oo. We also make use 
of the fact that 

a®b-c®d=(b,c)-a®d (6.21) 
where ( . , . ) is the usual L 2 -inner product. Therefore 

2 

(Q e ®^)e [X,D] = ^{-l) k Q e ®i)-t k ®5 k 

k=l 
2 

= J](-l) fe (^,e fc ) Q t ® 5 k . 
k=i 

DA N (t,X) 



k=l 

It follows that 



D 2>N (t,X) 

is the determinant of 



(6.22) 



1 2 

I-Q e ®Xi/j-- ^(-lf|(S + J R5)e fc + (f e fc ) Q e \ ® 5 k . (6.23) 



2 

k=l 
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We now specialize to the case of one interval J = (t, oo), so m = 1, ai = t 
and a 2 = oo. We write e t = ei, and = e 2 , and similarly for 5*.. Writing 
out the terms in the summation and using the fact that 



1 

2' 



(6.24) 



yields 



I-Q e ®XV>+~ [(5 + /^5)e t +(f e ( ) Qe]®S t +~ [(3 + RS) 1 + ( ^ , 1 ) Q £ ]®5 C 

(6.25) 

Now we can use the formula 



det f J - ^ a, <g> $ J = det (<5, 



fc - (a,- ,/3 fc )) 



l<j,k<L 



(6.26) 



In order to simplify the notation in preparation for the computation of the 
various inner products, define 

Q(x,X,t) := (I-S'X)-V, P(x,A,t) := 

Q e (a:,A,*) := (I-SX^eip, P e (x, X,t) := (I - S X)' 1 eip, (6.27) 



9 N 
ft 



Q(t,\,t), 

Qe{t,X,t), 

(Q,Xe<f) 
{P,Xeip) 



Qe,X(f 
Qe,Xip 



Pn 
Pe 
Ve 



P(t,X,t), 

P e {t,X,t), 

(Q,Xeij 
(P,Xeij 



(P e ,XV), (6.28) 



^4 : = / e t (x) P(x,t) dx, 0,4 := / e t (x) Q(x,t) dx, 1Z 4 := / e t (x) R(x,t) dx, 
Jk Jk Jr 

(6.29) 

where we remind the reader that e t stands for the function e{x—t). Note that 
all quantities in (|6.28|) and (J6.29j) are functions of t and A alone. Furthermore, 
let 



e <p(oo) 



(p(x)dx, Cip = etp(oo) = - I i/j(x)dx. (6.30) 
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Recall from the previous section that when (3 = 4 we take N to be odd. It 
follows that (f and if) are odd and even functions respectively. Thus when 
(3 = 4, Ctp = while computation using known integrals for the Hermite 
polynomials gives 

c, = (ttAO 1 ^- 3 ^ 2 ^ V\. (6.31) 
Hence computation yields 



lim 




(6.32) 



At t — oo, 

u e (oo) = 0, (Ze(oo) = (6.33) 



7^4(00) = -c^, Q4(oo) = -Cy,, 7^4(00) = 0. (6.34) 
In (I6.26j) . L = 3 and if we denote a 4 = (if) ,e t ), then we have explicitly 

<X! = Q e , a 2 = -~ [(S + RS)e t + a 4 Q e ], a 3 = -i [(5 + 1 + (^ , 1 ) Q £ ] , 

/3i = P2 = S t , Pa = 5oo- 

However notice that 

((S + RS)e t ,6 oo ) = (e t ,6 oo ) = 0, ((S + RS) 1,8^) = (1,^) = 

(6.35) 

and (Q t ,800) = c v = 0. Therefore the terms involving @ 3 = 800 are all and 
we can discard them reducing our computation to that of a 2 x 2 determinant 
instead with 

a x = Q e , a 2 = -- [(S + RS) e t + a 4 Q e ] , f3 1 = Xip, f3 2 = 8 t . (6.36) 
Hence 

(ai,Pi)=v e , (ai,(3 2 ) = q e , (6.37) 
(a 2 ,P 1 ) = -- (V 4 - a A + a 4 v e ) , (6.38) 

(a 2 ,(3 2 ) = -- (^ 4 + a 4 g e ). (6.39) 
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We want the limit of the determinant 



det (S jk - ( OLj , fl k ))i<j, fc< 2 > ( 6 - 40 ) 

as A" —>■ oo. In order to get our hands on the limits of the individual terms 
involved in the determinant, we will find differential equations for them first 
as in [23]. Adding a 4 /2 times row 1 to row 2 shows that a 4 falls out of the 
determinant, so we will not need to find differential equations for it. Thus 
our determinant is now 

I \~* ! ~t ni I • (6.41) 

Proceeding as in we find the following differential equations 
d d 

— u e = -q N q £ , j- t Qe = q N ~ q N v t -p N u e , (6.42) 

Jl Qi = + 1) , jft = ~P N (ft 4 + 1) > (6-43) 

—n^-p N Q i -q N V4. (6.44) 

Now we change variable from t to s where t = r(s) = \/2N + s/(y/2 N 1 ^ 6 ) 
and take the limit A" — > oo, denoting the limits of q e , V±, Q±, 7£ 4 , and 
the common limit of u e and v e respectively by q, "P 4 , <2 4 , 7£ 4 and u. Also 
Va and Q 4 differ by a constant, namely <2 4 = P 4 + y/2/2. These limits 
hold uniformly for bounded s so we can interchange hin^^oo and 4-. Also 
limAr^oo N~ l l & q N = lim/vr^oo N~ 1 ^ 6 p N = q , where q is as in (|3.1U|) . We obtain 
the systems 



jLp 4 = __L 9(Rl + 1 ), ^ K4= __Lj 2P4 + y|], (6 .46) 
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The change of variables s — > \i — q(x, \)dx transforms these systems 
into constant coefficient ordinary differential equations 

-2m), (6.47) 



(6.48) 



d 

d \i 


71*' 


d 

d \i 


1 






d jj, 


f 

7! 



d^ 1 ^ d— 1 — /A 



Since lim^oo/i = 0, corresponding to the boundary values at t = oo which 
we found earlier for V^H-^ we now have initial values at fi = 0. Therefore 

u(p = 0) = q{fi = 0) = 0, (6.49) 



p A (ji = 0) = - d 7e 4 (/x = 0) = 0. (6.50) 



2 

We use this to solve the systems and get 



q = —= (e-" - e") (6.51) 



u = ~ 2 [l-- 2 e»--e-») (6.52) 
V* = ^(^*-^e-*-V\) (6.53) 



TZ 4 = — e~ M -l (6.54) 

4 4 

Substituting these expressions into the determinant gives (jSHSJ)? namely 

£4(3, A) = £2(5, A) cosh 2 f J , (6.55) 




where £)g = limjv-»oo -D^,jv- Note that even though there are A-terms in 
f!6.53|) and ()6.54|) . these do not appear in the final result ()6.55|) . making 
it similar to the GUE case where the main conceptual difference between 
the m = 1 (largest eigenvalue) case and the general m is the dependence 
of the function q on A. The right hand side of the above formula clearly 
reduces to the (3 = 4 Tracy- Widom distribution when we set A = 1. Note 
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that where we have -D 4 (s, A) above, Tracy and Widom (and hence many 
RMT references) write -D 4 (s/\/2, A) instead. Tracy and Widom applied the 
change of variable s — > s/\^2 in their derivation in [33] so as to agree with 
Mehta's form of the (3 = 4 joint eigenvalue density, 6 which has — 2x 2 in the 
exponential in the weight function, instead of —x 2 in our case. To switch 
back to the other convention, one just needs to substitute in the argument 
s/y/2 for s everywhere in our results. At this point this is just a cosmetic 
discrepancy, and it does not change anything in our derivations since all the 
differentiations are done with respect to A anyway. It does change conventions 
for rescaling data while doing numerical work though. 

7 The Distribution of the m th Largest Eigen- 
value in the GOE 



7.1 The distribution function as a Predholm determi- 
nant 

The GOE corresponds case corresponds to the specialization /3 = 1 in (|3.1|) 
so that 

rr N N 

Gi ;N (t,X) = C[ N) J ■ ■ ■ j \\\xj - x k \ \\w(xj) JJ (1 + f(xj)) dxi ■ --dxN 

(7-1) 

where w(x) = exp (— x 2 ), f(x) = —A Xj(x), and C[ N ^ depends only on N. As 
in the GSE case, we will lump into C± any constants depending only on iV 
that appear in the derivation. A simple argument at the end will show that 
the final constant is 1. These calculations more or less faithfully follow and 
expand on [H3j. We want to use (|4.2jl . which requires an ordered space. Note 
that the above integrand is symmetric under permutations, so the integral 
is n\ times the same integral over ordered pairs Xi < . . . < x^ . So we can 
rewrite 17.11 as 

r r N N 

(N\) /•••/ Yl(x k -x j )Y[w(x k )Y[0- + f(xk))dx 1 ---dx N , 



X\<...<Xj~ 



j<k 
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where we can remove the absolute values since the ordering insures that 
(xj — Xj) > for % < j. Recall that the Vandermonde determinant is 



A N {x) = det(4 1 ) 1 < J , fc < JV = (-1)^ n Y[{xj -x k ). 

j<k 

Therefore what we have inside the integrand above is, up to sign 

det(^~ 1 w(x k ) (1 + /Ofc)))^.^. 

Note that the sign depends only on N. Now we can use (|4.2JI with 

<Pj(x) = x- 7-1 w(x) (1 + f(x)). 

In using (|4.2j) we square both sides so that the right hand side is now a 
determinant instead of a Pfaffian. Therefore G\ N (t, A) equals 

l<j,k<N 

Shifting indices, we can write it as 
C[ N) det(f I sgn(x-y)x i y k (l + f(x))w(x)w(y)dxdy] (7.2) 

\J J ' l<j,k<N-l 

where is a constant depending only on N, and is such that the right 
side is 1 if / = 0. Indeed this would correspond to the probability that 
jGoe(n) <- ^ or equivalently to the case where the excluded set J is empty. 
We can replace x J and y k by any arbitrary polynomials Pj(x) and Pk(x), of 
degree j and k respectively, which are obtained by row operations on the 
matrix. Indeed such operations would not change the determinant. We also 
replace sgn(x — y) by e(x — y) — \ sgn(x — y) which just produces a factor 

of 2 that we absorb in Cj . Thus G\ N (t, A) now equals 

Cr det (ff <x - y )M y) (1 + /(,)) (1 + H„ «b)***V 

(7-3)" ~ 

Let ipj(x) = Pj(x) w(x) so the above integral becomes 

Cf° det (j j e(x - y) ^(x) Mv) (1 + /(*) + f(v) + fix) f(y)) dxdy 

(7.4) " 
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Partially multiplying out the term we obtain 

+ / / e(x-y)ipj(x)ipk(y)(f(x) + f(y) + f{x) f(y))dxdy 



0<j,k<N-l 

Define 

M= \ 4 X -y)i J j( x )4>k(y)dxdy) , (7.5) 

so that G\ N (t, A) is now 

C[ N) det(M + \ t e{x -y)^{x)^ h {y){f{x) + f{y) + f{x) f{y))dxdy 



0<j,k<N-l 



Let e be the operator defined in (J6.2|) . We can use operator notation to 
simplify the expression for G^ N (t,X) a great deal by rewriting the double 
integrals as single integrals. Indeed 




eO -y)ipj(x)ip k (y) f(x)dxdy = / f(x)ipj(x) J e(x - y)ip k (y) dydx 

f ipjeipkdx. 



Similarly, 

(x - y) ipj(x) ip k (y) f(y) dxdy = - J J e(y - x) ipj{x) ip k {y) f(y) d x dy 

= - f(y)iPk(y) / e(y -x)4> j (x)dxdy 




f(x)ip k (x) J e(x - y)i)j(y)dydx 
fipke^pjdx. 
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Finally, 

e(x - y) i)j{x) ip k (y) f(x) f(y) dxdy 

e(y - x) tpj(x) ip k (y) f(x) f(y)dxdy 
f{y)ipk{y) [ e(y - x) f{x)i)j{x)dxdy (7.6) 
f(x)ip k (x) J e(x-y) f{y)i) j {y)dydx 
= - J f 'ipk^U 'ipj)dx. 

It follows that 

Gl N (t,X) = c[ N) det (m+ I Ifipjefa-ffaefy-ffaeif^dx) 

(777)- 

If we let M 1 = (fijk) 0< k<N t , and factor det(M) out, then G\ N {t, A) equals 
Cf° det(M) det yi+ 

M' 1 ■ ( I [fi(>jeif) k -fi() k eil) j -fif) k e(fi() j )]dx 

(7.8) 

where the dot denotes matrix multiplication of M _1 and the matrix with the 
integral as its (J, fc)-entry. define rjj = . jij k ip k and use it to simplify the 
result of carrying out the matrix multiplication. From (J7.5j) it follows that 
det(M) depends only on iV we lump it into C[ N \ Thus G\ N (t, A) equals 



C[ N) det I I + ( / [/ j]jeij) k - f ij) k e rjj - f i[) k e (/ rjj)} d 



x 



0<j,k<N-l/ < J . fc < JV _ 1 
" (7 - 9) 

Recall our remark at the very beginning of the section that if / = then the 
integral we started with evaluates to 1 so that 

Cf° det (/) = Cf°, (7.10) 
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which implies that C[ N) = I. Now G\ N {t, A) is of the form det(J + AB) 
where A : L 2 (J) x L 2 (J) -> C w is a iV x 2 matrix 



Mi \ 

mw 



whose row is given by 



A j = A i ( x ) = (~f er lj ~ ft (fVj) f Vj) ■ 



Therefore, if 



9 = 



9i 

92 



E L 2 {J) x L 2 (J), 



then A g is a column vector whose j th row is ( Aj , g 



L 2 xL 2 



( A 9)j = J [-feVj-fe(fVj)] 9idx + J fr] j g 2 dx. 

Similarly, B : C N -> L 2 (J) x L 2 (J) is a 2 x N matrix 

B = { Bi B 2 ... 5jv ) , 
whose j" 1 column is given by 

( 



Thus if 



then B h is the column vector of L 2 (J) x L 2 (J) given by 
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Clearly AB : C N ^ C N and B A : L 2 (J) x L 2 (J) -> L 2 (J) x L 2 (J) with 
kernel 

- Ej % ® / e vj - Ej % ® / e (/ '// ) Ej V'j ® / '/. ; 

- Ej< '• './ / ' '/j - E/ '-V ®fe(fVj) Ej ® / '/./ 

Hence I + BA has kernel 

/ - E, ® / e '/, - E, ® / e (/ '/.,) Ej i>3 ® f 'U 
~ Ej e ^j®f e Vj - Ej 6 V'i ® fe(fVj) 1 + E j e V'j ® / '/. ; 

which can be written as 

> E ; '-v /"/, E^<8>/% W / o 
E,"-', /"/, </ / • E,"', /'/, 7 ' V '/ J 

Since we are taking the determinant of this operator expression, and the 
determinant of the second term is just 1, we can drop it. Therefore 



G 2 hN (f,A) = det 



/ - Ej ® / e % Ej ^' ® / '/, 



where 



J 3 3 J I J -J • ^] 

det(/+ 



^ V " E,^ ® £77, - 6 E.-e^ ® % J 



and l^jv has matrix kernel 
We define 

JV-l 

S N (x,y) = -^2 ipj(x)iM jk eip k (y). 

j,k=0 

Since M is antisymmetric, 

JV-l JV-l JV-l 

Sn(v,x) = -^2 i)j(y)^ jk ei) k (x) = ^ ^j(y)^ kj eilj k (x) = e^j(x)fi jk ipk(y)- 

j,k=Q j,k=0 j,k=0 
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Note that 

N-l 

eS N (x,y) = ^ e 4>j( x ) Hjkeipkiy), 

j,k=0 

whereas 

d Ar_1 
-—S N (x,y) = V i/>j(x)ii jk il) k (y). 
dy 

So we can now write succinctly 

K lA x,v)=( "fi ~W'\ V) ) (7.11) 

V eS N (x,y) - e S N {y,x) ' 



So we have shown that 



G ltN {t,X) = yjD ltN (t,A) (7.12) 

where 

Di,N{t, A) = det {I + K 1)N f) 
where is the integral operator with matrix kernel Ki ^(x,y) given in 

(nm. 

7.2 Gaussian specialization 

We specialize the results above to the case of a Gaussian weight function 

w(x) = exp (-x 2 /2) (7.13) 

and indicator function 

f(x) = -\X J J=(t,oo) 

Note that this does not agree with the weight function in ()3.1|) . However it is a 
necessary choice if we want the technical convenience of working with exactly 
the same orthogonal polynomials (the Hermite functions) as in the (3 = 
2, 4 cases. In turn the Painleve function in the limiting distribution will be 
unchanged. The discrepancy is resolved by the choice of standard deviation. 
Namely here the standard deviation on the diagonal matrix elements is taken 
to be 1, corresponding to the weight function (j7.13J) . In the (3 — 2,4 cases 
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the standard deviation on the diagonal matrix elements is l/v2, giving the 
weight function (jfj.fjjl . Now we again want the matrix 

M=i / e(x - y) ipj(x) ip h (y) dxdyj = ( / ipj(x) eip k (x) dx 

^ J ' 0<j,k<N-l ^ ^ 0<j,fc<JV-l 

to be the direct sum of y copies of 

1 
-1 

so that the formulas are the simplest possible, since then fij k can only be 
or ±1. In that case M would be skew-symmetric so that M _1 = —M. In 
terms of the integrals defining the entries of M this means that we would 
like to have 

^2m(x) e^ 2n +l(x) d X = 6 m>n , 

and otherwise _ 

tpj (x) —— tpk(x)dx = 0. 
dx 

It is easier to treat this last case if we replace it with three non-exclusive 
conditions 

fp2m(x) e4>2n(x)dx = 0, 
1p2m+l(x) eif) 2 n+l(x) dx = 

(so when the parity is the same for j, k, which takes care of diagonal entries, 
among others), and 

ipj(x) eijjk(x) dx = 0. 



whenever \j — k\ > 1, which targets entries outside of the tridiagonal. Define 

1 



Vn(x) = —H n (x) exp(-x /2) for c n = \J 2 n n\^fn 
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where the H n are the usual Hermite polynomials defined by the orthogonality 
condition 

J Hj(x) H k (x) e~ x2 dx = c| 5j^- 

It follows that 

J (fj(x) Lp k (x)dx = 5 jtk . 
Now let ^ 

An+l( X ) = -l-<P2n(x) A n ( X )=<P2n(x)- (7.14) 

ax 

This definition satisfies our earlier requirement that ipj = pj w for 

w(x) = exp (— x 2 /2) . 

In this case for example 

P2n{x) = — H 2n (x) . 

With e defined as in (|6.2j) . and recalling that, if D denote the operator that 
acts by differentiation with respect to x, then D e = e D = I, it follows that 

^2 m i X ) ^2„ + l (x)dX 



^2 m ( X ) ^2„ + l 0) d 0) 

as desired. Similarly, integration by parts gives 



/ 



/I 
J^^zm ( x ) e^ 2n (x)dx 

= - ^ ( x ) <^ 2 „ (x)dx 



^2 m ( X ) ¥?2n + l i X ) ^ i X ) 



-- S„ 
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Also ip 2n is even since H 2n and ip 2n are. Similarly, ip 2n+1 is odd. It follows 
that e ip 2n j an d e V^n+i > are respectively odd and even functions. From these 
observations, we obtain 



^2n(x) eip 2m (x)dx = 0, 
since the integrand is a product of an odd and an even function. Similarly 

^2u+i(x)eip 2m+1 (x)dx = 0. 
Finally it is easy to see that if \j — k\ > 1, then 

ip.(x) eijj k (x)dx = 0. 



Indeed both differentiation and the action of e can only "shift" the indices 
by 1. Thus by orthogonality of the ipj, this integral will always be 0. Thus 
by our choice in (|7.14j) . we force the matrix 



M—\j ipj(x) eijjk(x) d x 

0<j,k<N-l 

N 



to be the direct sum of y copies of 

1 
-1 

This means M _1 = — M where M~ l = {fij k}- Moreover, fij^ = if j, k 
have the same parity or \j — k\ > 1, and fi2j,2k+i = $jk — ~^2k+i,2j for 
j, k = , — — 1. Therefore 

N-l 

S N (x,y) - - ^ if>j(x)wl>k(y) 

j,k=0 

N/2-1 N/2-1 

= - ^2j(x)e^2j+i(y) + ^2j+i(x) eip 2j (y) 

3=0 j=0 
' N/2-1 N/2-1 

sr ( x \ (v\ sr d ( x \ (v 

3=0 j=0 
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Manipulations similar to those in the (3 = 4 case (see (|6.8|) through 
yield 



S N {x,y) 
We redefine 



.i=o 



AT-l 



Sjv(z, y) = Y V, ( x ) Pi (y) = S n(v, x), 

3=0 

so that the top left entry of Ki^(x,y) is 

W 



Sn(x, y) + \l — <p N _ x (x) (e tp N ) (y) . 
If Sn is the operator with kernel Sjy(x,y) then integration by parts gives 



S N Df = I S(x,y)-£-f(y)dy 



d 

dy 



S N (x,y) f(y)dy, 



so that —j^Sn(x, y) is in fact the kernel of S^D. Therefore (j7.12|) now holds 
with Kx t N being the integral operator with matrix kernel Ki^{x,y) whose 
(i, j)-entry K[y(x,y) is given by 



K ( ?£\x,y) = e 
K?$(x,y) = 



S N (x, y) + \/ — V N -! 0) (e <p N ) (y) 



SD N (x,y)-j- (y^(f N ^{x)(e<f N ){y) 



S N (x,y) + \j — (x) (eip N ) (y) - 1 



S N (x,y) + \l— (eip N ) (x) </V-j (y) 



Define 



<p(x) 



N\ 1/A 
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so that 




X(x) [S N (x,y) + iff(x)e<p(y)] X(y), 

X(x) [SD N (x,y)-if>(x)<p(y)) X{y), 

X(x) [eS N (x, y) + e i[)(x) e <p(y) - e(x 

X(x) [S N (x,y) + 6<p(x)iP(y)) X(y). 



Note that 



X (eS + e^p® e<p-e) X = Kffi(x, y), 
X(S + ep® eV) X = Kf^\x,y). 



X (S + 4>® ep) X = K[^\x,y), 
X (SD-^j® p) X^K^\x,y), 




Hence 



K 1>N = X 



S + ip® ep SD - -0 ® ip 
eS + eip®ep — e S + ep®eip 



) 



X. 



Note that this is identical to the corresponding operator for (3 = 1 ob- 
tained by Tracy and Widom in [33], the only difference being that ip, ip, and 
hence also S, are redefined to depend on A. 

7.3 Edge scaling 

7.3.1 Reduction of the determinant 

The above determinant is that of an operator on L 2 (J) © L 2 (J). Our first 
task will be to rewrite these determinants as those of operators on L 2 (J). 
This part follows exactly the proof in [33]. To begin, note that 



[S ,D] = <p <g> ip + if) ® (p 



(7.15) 



so that (using the fact that De = eD = I) 



e,S] 



e S — S e 

eSDe-eDSe = e [S,D] e 

ep ®ipe + eip ® pe 
e p ® e t tp + e ip <g> e l <p 
—ep®eip — eip ® ep, 



(7.16) 
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where the last equality follows from the fact that e* = 



— e. We thus have 



D (eS + ei> <S> eip) = S + tp^eip, 
D (eSD-eil)®^) = SD-?p®ip. 

The expressions on the right side are the top entries of K\^. Thus the first 
row of Ki N is, as a vector, 

D (eS + ei/) <g> eip,eS D - eif) <g> <p) . 

Now (J7.16|) implies that 

eS + etp®6(p = Se — etp ® eijj. 

Similarly (|7.15|) gives 

e [S ,D) = eip <g> i/j + eip g) <p, 

so that 

eSD-e?p<g)(p = eDS + eip<$?p = S + e(p<g)tp. 
Using these expressions we can rewrite the first row of K\ y N as 

D (S e — eip^eip.S + eip®^). 
Applying e to this expression shows the second row of K\,n is given by 

(eS — e + ev/j®eip,S + eip®ijj) 
Now use f!7.16|) to show the second row of is 

(S e — e + eip®e^,S + e(p®ip). 

Therefore, 

K = x ( D {Se-eip®e^) D (S + eip®4>)\ 
I Se — e + e<p ® eip S + eip®^ J 

XD \ / (Se-e^®ei))X {S + e^®4))x\ 
X J 1 (Se - e + tip ® tip) X (S + eip ®ip)X J ' 
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Since Ki n is of the form A B, we can use the fact that det(J— A B) = det(J— B A) 
and deduce that Di >N (s, A) is unchanged if instead we take K\,n to be 



N 



(Se-eip®eip)X (S + ecp <g> if)) X \ ( X D 
(Se - e + ecp <g> eip) X (S + ecp (g) ip) X J \ X 

(Se - ecp <g> eip) XD (S + e<p ® ip) X ~ 
(Se-e + ecp®eip)XD (S + ecp ®ip)X 



Therefore 
Di,n(s, X) = det 



I - (Se - eip <g> eip) XXD - (S + ecp <g> ip) XX 
- (Se - e + ecp ® eip) XXD I - (S + ecp <g> ip) XX 

(7.17) 

Now we perform row and column operations on the matrix to simplify it, 
which do not change the Fredholm determinant. Justification of these oper- 
ations is given in [33] ■ We start by subtracting row 1 from row 2 to get 



r -(Se- eip® eip) XXD - (S + eip <g> ip) XX 
-I + eXXD I 

Next, adding column 2 to column 1 yields 

I- (Se-e<p<S>eip) XX D - (S + eip <g> ip) XX - (S + eip®ip) XX 
eXXD I 

Then right-multiply column 2 by —eXXD and add it to column 1, and 
multiply row 2hyS + ecp®ip and add it to row 1 to arrive at 



det 



I-(Se-ecp®eip) XXD + (S + etp®ip) XX (eXXD-I) 

/ 



Thus the determinant we want equals the determinant of 

I-{Se-ecp®eip) XXD + (S + ecp®ip) XX{eXXD-I). (7.18) 

So we have reduced the problem from the computation of the Fredholm 
determinant of an operator on L 2 (J) x L 2 (J), to that of an operator on 
L\J). 
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7.3.2 Differential equations 



Next we want to write the operator in (j7.18|) in the form 

(I-K 2 , N ) \ I-J2<*i®Pi\ , (7-19) 

where the a, and /3j are functions in L 2 (J). In other words, we want to rewrite 
the determinant for the GOE finite dimensional perturbation of the 

corresponding GUE determinant. The Fredholm determinant of the product 
is then the product of the determinants. The limiting form for the GUE part 
is already known, and we can just focus on finding a limiting form for the 
determinant of the finite dimensional piece. It is here that the proof must be 
modified from that in [33] . A little simplification of (J7.18j) yields 

I-XSX-XS (1-AX) eXD-\ [etp g> X^)-\ {eip ® ip) {1 - XX) eXD. 

Writing e [X , D]+X for eX D and simplifying (1 — A X) X to (1 — A) X gives 



/ - A S X - X (1 - A) S X - A (e<p ® Xip) - A (1 - A) (e<p ® Xip) 

- X S (1 - XX) e [X , D] - X (etp <g> ip) (1 - XX) e [X , D] 

= I-{2X-X 2 )SX-{2X-X 2 ) (etp ® Xip) - XS (1 - XX)e [X,D] 

- X{ecp ® ip) (1 - AX)e [X,D] . 

Define A = 2 A — A 2 and let \fx <p — > tp, and — > ip so that A 5 — >■ 5 and 
dZJB) goes to 

/-5X-(e^ ® XV) - ^5(1 - AX)e 

A 

-i(ep <g> V)(l-AX)e [X,£>]. 
A 

Now we define R :— (I — S X)^ 1 S X = (I — S X)~ l — I (the resolvent operator 
of SX), whose kernel we denote by R(x, y), and Q e :— (I — S X)^ 1 etp. Then 
()7.18|) factors into 

A = (I-SX)B. 

where B is 
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I-(Q e <£> X ip) — — (I + R) S (1 — XX) e [X,D] 
X 

-j(Q e ®i/>)(l-\X)e[X,D], A^l. 
X 

Hence 

D hN (s,X) = D 2iN (s,X) det(B). 

Note that because of the change of variable A S — > S, we are in effect factoring 
/ — (2A — A 2 ) S, rather that / — A S as we did in the (3 = 4 case. The fact 
that we factored I — (2A — A 2 ) S X as opposed to / — A S X is crucial here for 
it is what makes B finite rank. If we had factored I — XSX instead, B would 
have been 

2 

B = I-X (S + RS) (I- XX)e k ®8 k -X (I + R) ey? <g> Xip 

k=i 

2 

-A ^(-l) fc (^,(/-AX) e k ) ((I + R) ey)®8 k 
k=i 

-A(l-A) (S + RS) X-A(l-A) ((I + R) e^)®X^ 

The first term on the last line is not finite rank, and the methods we have 
used previously in the (5 — 4 case would not work here. It is also interesting 
to note that these complications disappear when we are dealing with the case 
of the largest eigenvalue; then is no differentiation with respect to A, and we 
just set A = 1 in all these formulae. All the new troublesome terms vanish! 
In order to find det(-B) we use the identity 

2m 

e [X, J D] = ^(-l) fc e fc ®4, (7.20) 
k=i 

where e k and 5 k are the functions e(x — a k ) and 5(x — a k ) respectively, 
and the a k are the endpoints of the (disjoint) intervals considered, J = 
u fcLi( Q 2fe-i, a 2 fc). We also make use of the fact that 

a®b ■ c<g> d= (b ,c) • a® d (7-21) 

where ( . , . ) is the usual L 2 -inner product. Therefore 
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2m 



(Q e ®V)(l-AX)e [X,L>] =^(-l) fe g e ®^-(l-AX)e fc ®4 

fc=i 

2m 

= ^(-l) fe (V,(l-AX)e fc ) g e ® <f fc . 



fc=l 

It follows that 

D 2 , N (s,\) 

equals the determinant of 

> 2m 

I-Q e ®Xil>-j ^(-l) fc [{S + R S) (1 - XX) e k + (ip , (I - XX) e k ) Q e ]®5 k . 
X k=i 

(7.22) 

We now specialize to the case of one interval J = (t, oo), so m — 1, a\ — t 
and a 2 = oo. We write e t = ei, and = e 2 , and similarly for Writing 
the terms in the summation and using the facts that 

too = ~\ (7.23) 

and 

(1 - A X) e t = -- (1 - AX) + (1 - A X) X, (7.24) 

then yields 

I-Q e ®Xip- A [(5- + J R5-)(i-AX) + (^,(l-AX)) Q £ ] (8) (5* - 5oo) 
2A 

+-[(S , + i?S , )(l-AX)X+(^,(l-AX)X) Q e ]®<s t 
A 

which, to simplify notation, we write as 

/ - Q e <g> - -4 [(S 1 + R S) (1 - A X) + a ltX Q e \ ® (S t - 
2 A 

+i [(5 1 + i? S 1 ) (1 - A X) X + a M Q e ] <g> <J t , 
A 
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where 

o liA = (^,(l-Ax)), oi >A = (V,(l-AX)X). 
Now we can use the formula: 



det ^1 - ^2 a i ® A J = det (^. 



'jjfe - («j , A )) 



l<j,k<L 



In this case, L = 3, and 



A 



ai=Qe, a 2 = j [(S + RS) (1 - AX) + a 1>x Q e ] 
X 

a 3 = - t [(3 + RS) (1 - A X) X + o liA QJ , 
A 

(3i = Xip, p 2 = 8 t - Soo, fa = 8 t . 
In order to simplify the notation, define 



(7.25) 



(7.26) 



(7.27) 



Q{x,X,t) := (I-SX)' 1 ^, P{x,X,t) := (I-SX)- 1 ^, 
Q £ (x,X,t) := (I-SX^eip, P £ (x,\,t) := (I - S X)' 1 e i/j , (7.28) 



Vr 



Q(t,X,t), p N 

Qe(t,X,t), p e 

(Q ,Xe(p) = (Q € ,Xip) , v € 

(P,Xe(p) = {Q e ,X<p), w e 



P{t,X,t), 
P e {t,X,t), 
(Q,XeV) 
(P,XeV) 



(P e ,XV), (7.29) 



J(l-XX)Pdx, V 1>x := XX)XPdx, 



Q ljA := (l-XX)Qdx, 



Q liA := / (1 - AX)XQdx, 



fti, A := y (1 - XX)R{x,t)dx, Ki,\:= J (1- XX)XR(x,t)dx. (7.30) 

Note that all quantities in (|7.29|) and (J7.3U)) are functions of t alone. Fur- 
thermore, let 



c v = eyj(oo) 



2 POO ^ P°° 

- (p(x)dx, c^ = eip(oo) = - 



ip(x)dx. (7.31) 
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Recall from the previous section that when (3 = 1 we take N to be even. It 
follows that <f and ip are even and odd functions respectively. Thus = 
for /3 = 1, and computation gives 



C v = (7T 



jV)l/4 2 



-3/4- AT/2 



(AT!) 



1/2 



(JV/2)! 



Hence computation yields 



and at t = oo we have 



lim c<. 



u e (oo) = 0, g e (oo) = 

Pi,a(oo) = 2c^, Qi,a(oo) = 2c v , 7£i jA (oo) 
A,a(oo) = Qi,a(oo) = £i A (oo) = 0. 



Hence 



>i , A ) 
>2 , A ) 

>2 , A ) 

>2 , A ) 

>3 , A ) 

>3 , A ) 

>3 , A ) 



Ql ,/3 2 



^ [Pi,A-ai,A(l-^)], 

Z A 
2 A 



A 

A 
A 

A 
A 

A 



Pl,A - 5l,A (1 - «e) 



(7.32) 



(7.33) 



(7.34) 
(7.35) 

(7.36) 

(7.37) 

(7.38) 

(7.39) 

(7.40) 



As an illustration, let us do the computation that led to ()7.36|) in detail. As 
in [33], we use the facts that S* = S, and (S + SB 1 ) X = B which can be 
easily seen by writing B = YlkLi{S . Furthermore we write B(x,a>k) to 
mean 

Hm B(x,y). 
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In general, since all evaluations are done by taking the limits from within J, 
we can use the identity X 8k = Sk inside the inner products. Thus 

(a 2 ,(3 2 ) = t [((S + RS)(l-XX),8t-6 00 )+a 1 ,x ( Q e , tf* - )] 
A 

= j [( (1 - A X) , (S + R* S) (5 t - 5oo) ) + a 1)A (Q e {t) ~ Qe(°°))] 
A 

= ~[({l-\X),{S + R t S)X{6 t - 6 00 )) + a liA (q e - C(p )] 
A 

= ~ [( (1 - A X) , R(x, t) - R(x, oo) ) + ai, A (q e - c 9 )\ 
A 

= -~ [Ri,\{t) - Ri,x(oo) + a liA (q e - c^,)] 
A 

= y [^-i,a(*) + Ol,A (& - . 
A 



We want the limit of the determinant 

det (5 jk - (uj,^))^^. 



(7.41) 



as N — > oo. In order to get our hands on the limits of the individual terms 
involved in the determinant, we will find differential equations for them first 
as in jHSI- Row operation on the matrix show that a\ t \ and a^x fall out of the 
determinant; to see this add A a\^/{2 A) times row 1 to row 2 and Acz^a/A 
times row 1 to row 3. So we will not need to find differential equations for 
them. Our determinant is 



/ 1 



det 



Vr 



\ 



_ypi2 

2A 

APl,A 

A 



1 



{Qe ~ C<p) -Qe 



2A 
A7^i,a 
A 



1 + 



2A 

A£i,a 
A 



\ 



/ 



7.42) 
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Proceeding as in we find the following differential equations 
d 



dt 



Ue = q N q e , 



^Qi,a = q N (A - TZi,\) 



d_ 
di 

d „ 
di U ^ 



q„ A-l-ft 





d 




dt 




dt 




d * 
dt 


l,XJ ? 


d * 
dt 



Qe = q N ~ q N v e -p N u e , 



-p n Qi,x - q N V 



1,A, 



-1,A 



(7.43) 
(7.44) 
(7.45) 
(7.46) 



Let us derive the first equation in ()7.44j) for example. From [3T] (equation 
2.17), we have 

-R(x,t)q N . 



dt 



Therefore 

dQi,x 
dt 



d 

It 



oo 

t 



Q(x, t) dx — (1 — A) / Q(x,t)dx 



*dQ 

dx 

OO 



dt 



q N ~q N 



x q N -q N 



R(x, t) dx — (1 — A) q N + (1 — A) q N \ R(x,t)dx 
(1 - X)R(x,t)dx 



= >>q N -q N fti,A = q N (A - Ui,x) . 

Now we change variable from t to s where t = t(s) = 2 a yN Then we 

take the limit N — > oo, denoting the limits of q e , Vi t \, Qi,x, T^i,x, ^i,a> Qi,x, T^i,x 
and the common limit of u e and v e respectively by q, V\,\, Qx,x, 7^i,a> ^i.a, Qi,x, T^i,x 
and u. We eliminate Qi t x and Q\ t x by using the facts that Qi^ = Vi,\+X V% 

and Qi,x — *P\,\- These limits hold uniformly for bounded s so we can inter- 
change lim and j-. Also limA^oo A^ -1 / 6 ^ = liniAr^oo N~ x ^p N = q , where q 
is as in ()3.10|) . We obtain the systems 



d_ 

d s 



(7.47) 
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(7.48) 



ds' i,A V ' 7 ' ' v/2 



^,a = 9 (l - A - 7^) , = - ? 72^,, (7.49) 

The change of variables s — > fx — q(x, \) dx transforms these systems 
into constant coefficient ordinary differential equations 

7? = T2~ q - ^ 5 = -71 (1 - 2s) ' (7 - 50) 



^.^-L^-A), ^ = -1(2^^, (7.51, 



*-V»—±(l-X-K»), = (7.52) 



Since lim^oo/i = 0, corresponding to the boundary values at t = oo which 
we found earlier for Vi t \, Vi t \, TZ± t x, we now have initial values at /i — 0. 
Therefore _ _ 

7\a(0) = ft M (0) = Pi, A (0) = ft M (0) = 0. (7.53) 

We use this to solve the systems and get 

_V\-i u Vx + i_ u i /r7 . 

u=^—e» — e " + -, (7.55) 

— Va-a u \/a + a _ u , 

^i,a = 2 e 2 e + A ' (7 ' 57) 

? M = ^ (e" - e""), I M = ^ (e" + e"" - 2). (7.58) 
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Substituting these expressions into the determinant gives (|3.12|) . namely 

- A - 1 - cosh/x(s, X) + y/X sinh/i(s, A) 
Di(s, X) — D 2 {s, X) — , (7.59) 



where Dp = limjv-*oo Dp^. As mentioned in Section I2TT| the functional form 
of the (3 = 1 limiting determinant is very different from what one would 
expect, unlike in the (3 = 4 case. Also noteworthy is the dependence on 
A = 2 A — A 2 instead of just A. However one should also note that when A is 
set equal to 1, then A = A = 1. Hence in the largest eigenvalue case, where 
there is no prior differentiation with respect to A, and A is just set to 1, a 
great deal of simplification occurs. The above formula then nicely reduces to 
the (3 = 1 Tracy- Widom distribution. 

8 An Interlacing Property 

The following series of lemmas establish Corollary ()3.15j) : 
Lemma 8.1. Define 



d? / A 



aj ~ dXi V 2- A 



5.1) 



A=l 



Then cij satisfies the following recursion 



1 if 3 = 0, 

a j = { U ~ 1) %-i f or 3 > 1, 3 even, (8.2) 
jdj-i forj > 1, j odd. 



Proof. Consider the expansion of the generating function /(A) = y 7^ 
around A = 1 

/w = Eif( A - 1 ) i = E 6 i( A - 1 ) i - 

i>o J ' j>o 

Since cij = j \ bj, the statement of the lemma reduces to proving the following 
recurrence for the bj 

1 if j = 0, 

h i= { 3 ~J~ for 3 > !> 3 even, (8.3) 

for j > 1, j odd. 
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Let 



r en {x) = _ 





These are the even and odd parts of / relative to the reflection A — 1 
— (A — 1) or A — ► 2 — A. Recurrence ()8.3|) is equivalent to 

A. r en {x) = {x _ 1) _d_ f o dd{xl 

which is easily shown to be true. 
Lemma 8.2. Define 

/(a, A) 
for A = 2A- A 2 . Then 

Q2n 



2- A 



tanh 



□ 



(8.4) 



2 1 1 



d 



,2 n+1 



A=l 



2n+ 1 d\ 2n+1 



f(s,X) 



A=l 



1 




ifn = 0, 
ifn>l. 



J.5) 



Proof. The case n = is readily checked. The main ingredient for the general 
case is Faa di Bruno's formula 



g(h(t)) = 



ni 



hl-'-kJ \dh k 



(h(t)) 



1 dh\ kl 



- 



1! dt J 



1 d n h 
n\ dt n 



(8.6) 



where k = Y^7=i ^ anc ^ ^ ne a bove sum is over all partitions of n, that is 
all values of k±, . . . , k n such that Y^h=i ih = n. We apply Faa di Bruno's 



formula to derivatives of the function tanh 



which we treat as some 



function g(A(A)). Notice that for j > 1, jyj | is nonzero only when j = 2, 
in which case it equals —2. Hence, in (18. 6|) . the only term that survives is 
the one corresponding to the partition all of whose parts equal 2. Thus we 
have 



2n-k 



A=l 







(-l)"-i(2n-*)l Op tanh /£M) 



if k = 2j + 1, j > 
for k = 2j, j > 



A=l 
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d 2n - k+1 n(s,X) 
tanh PV - ; 



Q X 2n- 



-1-Jfe 



A=l 







(-l)"-J(2n+l-fc)! 



tanh 



ifk = 2j, j > 
for k = 2j + 1, j > 



A=l 



(n-j)! 9A"-J 

Therefore, recalling the definition of aj in (jH.lj) and setting k = 2 j, we obtain 

2n 



5 : 



2ti 



<9A 2n 



/(a, A) 



A=l 



E 

fc=0 
n 

E 

j=0 



2n\ <9 A 



tanh PV - ; 



k J d\ k V 2- A <9A 2 "" fc 



(2j)!(n-j)! 2J ,9A^ 2 



A=l 



A=l 



Similarly, using k = 2 j + 1 instead yields 



02 n+l 

<9A 2n+1 



/(a, A) 



A=l 



2 n+l 

E 

k=0 



2n + l\ d 



k J d\ k V 2- A <9A 2 " +1 



tanli v - 1 



(2» + i)t M!( -" , : a "^ 



A=l 

/i(s, A) 



i=o 
9 2 " 



(2j)!(n-j)! 2j + ld\"-i 



A=l 



(2n + 1) 9A^ /(5 ' A) 



A=l 



since a 2j+1 /(2j + 1) = a 2j . Rearranging this last equality leads to (|8.5|) . □ 
Lemma 8.3. Let Di(s, A) and D^s, A) &e as in (jH.12j) and Then 

~ \ 2 

//(s, A) 



D 1 (s J A) = D 4 (s,A) 1 



A 

2^A 



tanh ■ 



(8.7) 



Proof. Using the facts that —1 — coshx = —2 cosh | , 1 = cosh x — sinh x 
and sinh x = 2 sinh f cosh | we get 
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-1 - cosh fi(s, A) T\ A + VA sinh /i(s, A) 

L>i(s,A) = — D 2 (s,X) + D 2 {s,\) — 



A-2 



A-2 



A-2 



+D 2 (s, A) 



A 



cos h 2 _ sinh 2 M|A) j + ^ sinh/i ( S) a) 
A-2 



= D 4 (s,~\) + 



D 4 (s, A) A sinh 2 ^ - v 7 ! sinh /i(s, A) 



cosh' 



2- A 



= D 4 (s,~\) 1 



(^) 

A sinh 2 ^ - 2 v 7 ! sinh cosh 



(2 - A) cosh 2 



A 



= D A (s, A) I 1 - 2 tanh ^"J " 7 + tanh 2 



2- A 



/i(s, A) A 



/i(s, A) 



= Dt(s,\) 1 



A tanh^- A) 



2 2-A 

2 



2-A 



□ 



1 /o 

For notational convenience, define d\(s, A) = Ly (s, A), and rf 4 (s, A) 
£>4 /2 (s,A). Then 



Lemma 8.4. For n > 0, 



(2n + l)\ d\ 2n+l (2n)\ d\ 2n 



di(s, A) 



-l) n <9 n 



A=l 



n! d\ n 



rf 4 (s, A) 



A=l 



Proof. Let 



/(*,A) = 1 



A , u(s, A) 
tanh PV ' ; 



2-A 



by the previous lemma, we need to show that 



1 d 2n+1 1 d : 



,2n 



+ 



(2n+l)\ d\ 2n + l (2n)\ d A 2 ' 
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d 4 (s, A) /(s, A) 



(8.8) 



A=l 



equals 



-l) n d n 

— d 4 (s, A) 



n\ dX 

Now formula ()8.6|) applied to d 4 (s, A) gives 
d k 

A=l 



A=l 



dX k 
Therefore 







if k = 2j + l,j> 0, 



I Q2n+1 
~, TT TTTT, TT dJs, A) f(s, A) 

(2ti+1)I dA 2n+1 v ' JJK ' 



(2n + 1)! 

n 

-E 



A=l 
2n+l 

E 

fe=0 

;_l)i 



2n+l\ 9 fc ^n+l-fc 

"4 5 ,o n+ i_ fe / 



— (Z4 



dA fc dA 2 

02n-2j+l 



A=l 



3=0 



(2n-2j + d~\i d\ 2n -*i+ 



if 



A=l 



Similarly, 

1 <9 2 " 



(2n)! <9A 2 ' 



d 4 (s, A) /(s, A) 



A=l 



(2n)! 



E 

i=o 



2n 

E 

k=0 



2n\ d k 



2 n—k 



k J d\ k dX 2 



n—k 



f 



A=l 



-iy 



Qj Q2n-2j 



(2n-2j)\ j\ d\i dX 2n ~ 2 i 



A=l 



Therefore the expression in (|8.8|) equals 



E 

3=0 



(-iy 



(2n-2j)\j\ d\i 



d j 

— d 4 (s, A) 



Q2n-2j 



if 



1 



Q2n-2j+l 



d\2n-2j-> 2n-2j + l dX 2n ~ 2 i- 



1/ 



A=l 



Now Lemma 18.21 shows that the square bracket inside the summation is 
zero unless j = n, in which case it is 1. The result follows. □ 

In an inductive proof of Corollary 13. 15[ the base case F 4 (s, 2) = Fi(s, 1) is 
easily checked by direct calculation. LemmaEHestablishes the inductive step 
in the proof since, with the assumption F±(s,n) = Fx(s, 2n), it is equivalent 
to the statement 

F 4 (s,n + 1) =F 1 (s,2n + 2). 
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9 Numerics 



9.1 Partial derivatives of q(x,X) 
Let 

(9.1) 



Qn(x) = — g(x,A) 



A=l 



so that qo equals q from (J3.8j) . In order to compute Fp(s, m) it is crucial to 
know q n with < n < m accurately. Asymptotic expansions for q n at — oo 
are given in [ST]. In particular, we know that, as t — > +oo, qo(—t/2) is given 

by 

1 r ( 1 73 10657 13912277 n ( 1 
-Viil--- — -—- — - — + — 



2 V t3 2t& 2t9 stl2 \ tl5 

whereas q\(—t/2) can be expanded as 

exp(±t 3 / 2 ) / 17 1513 850193 407117521 n 

1 + 77T77777T + TT^TTTTTT + ^77777777 ^ E ^. g + O 



2 v / 2^t 1 /4 V 24i3/2 2 7 3 2 ^ 3 2 10 3 4 t 9 / 2 2 15 3 5 t 6 U 15/2 
These expansions are used in the algorithms below. 



9.2 Algorithms 

Quantities needed to compute Fp(s, m),m — 1, 2, are not only q and qi but 
also integrals involving q , such as 

/oo poo 
(x — s) qo(x) dx, Jo — qo(x)dx. (9.2) 

Instead of computing these integrals afterward, it is better to include them 
as variables in a system together with q , as suggested in J2Ej- Therefore 
all quantities needed are computed in one step, greatly reducing errors, and 
taking full advantage of the powerful numerical tools in MATLAB™ . Since 

POO 

I 'o = - Qoi x ) dx i J o = 9o> J 'o = -9o, (9.3) 

J s 

the system closes, and can be concisely written 



/ 


go 


\ 


/ 


<7o 


\ 




Q'o 






sq + 2ql 






Io 






r* 






I'o 










V 


Jo 


J 


V 


-qo 


I 



(9.4) 
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We first use the MATLAB™ built-in Runge-Kutta-based ODE solver ode45 
to obtain a first approximation to the solution of (|9.4|) between x = 6, and 
x = —8, with an initial values obtained using the Airy function on the right 
hand side. Note that it is not possible to extend the range to the left due to 
the high instability of the solution a little after —8. (This is where the tran- 
sition region between the three different regimes in the so-called "connection 
problem" lies. We circumvent this limitation by patching up our solution 
with the asymptotic expansion to the left of x = —8.) The approximation 
obtained is then used as a trial solution in the MATLAB™ boundary value 
problem solver bvp4c, resulting in an accurate solution vector between x = 6 
and x = —10. Similarly, if we define 



h = (x — s) qo(x) qi(x) d x, J\ 

J s 

then we have the first-order system 



q (x) qi(x)dx, 



(9.5) 



d 

ds 



h 



( 



V 



A 

qoqi 

-qo qi 



(9.6) 



which can be implemented using bvp4c together with a "seed" solution ob- 
tained in the same way as for q . 

The MATLAB™ code is freely available, and may be obtained by con- 
tacting the first author. 
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9.3 Tables 





Statistic 


Eigenvalue 




a 


7i 


72 


Ai 


-1.771087 


0.901773 


0.224084 


0.093448 


A 2 


-3.675440 


0.735214 


0.125000 


0.021650 



Table 1: Mean, standard deviation, skewness and kurtosis data for first two 
edge-scaled eigenvalues in the (3 = 2 Gaussian ensemble. Compare to Table 1 
in [33]. 





Statistic 


Eigenvalue 


n 


a 


7i 


72 


Ai 


-1.206548 


1.267941 


0.293115 


0.163186 


A 2 


-3.262424 


1.017574 


0.165531 


0.049262 


A 3 


-4.821636 


0.906849 


0.117557 


0.019506 


A 4 


-6.162036 


0.838537 


0.092305 


0.007802 



Table 2: Mean, standard deviation, skewness and kurtosis data for first four 
edge-scaled eigenvalues in the (3 = \ Gaussian ensemble. Corollary 13.151 
implies that rows 2 and 4 give corresponding data for the two largest eigen- 
values in the (3 = 4 Gaussian ensemble. Compare to Table 1 in keeping 
in mind that the discrepancy in the (3 = 4 data is caused by the different 
normalization in the definition of F^(s, 1). 
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100 x 100 


100 x 400 


Fx-Percentile 


Ai 


A 2 


A 3 


Ai 


A 2 


A 3 


0.01 


0.008 


0.005 


0.004 


0.008 


0.006 


0.004 


0.05 


0.042 


0.033 


0.025 


0.042 


0.037 


0.032 


0.10 


0.090 


0.073 


0.059 


0.088 


0.081 


0.066 


0.30 


0.294 


0.268 


0.235 


0.283 


0.267 


0.254 


0.50 


0.497 


0.477 


0.440 


0.485 


0.471 


0.455 


0.70 


0.699 


0.690 


0.659 


0.685 


0.679 


0.669 


0.90 


0.902 


0.891 


0.901 


0.898 


0.894 


0.884 


0.95 


0.951 


0.948 


0.950 


0.947 


0.950 


0.941 


0.99 


0.992 


0.991 


0.991 


0.989 


0.991 


0.989 



Table 3: Percentile comparison of F\ vs. empirical distributions for 100 x 100 
and 100 x 400 Wishart matrices with identity covariance. 
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